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SEQUENCE -DETERMINED DNA FRAGMENTS AND CORRESPONDING 
POLYPEPTIDES ENCODED THEREBY 

FIELD OF THE INVENTION 

The present invention relates to isolated 

polynucleotides that encode all, or a portion of, a gene that 
is expressed and the corresponding polypeptide. The present 
invention also relates to isolated polynucleotides that 
encode regulatory regions of genes. 

DESCRIPTION OF THE RELATED ART 

Efforts to map and sequence the genome of a number of 
organisms are in progress; a few complete genome sequences, for 
example those of E. coli and Saccharomyces cerevisiae are known 
(Blattner et al., Science 277 : 1453 (1997); Goffeau et al., 
Science 274 :546 (1996)). The complete genome of a 

multicellular organism, C. elegans, has also been sequenced 
(See, the C. elegans Sequencing Consortium, Science 282 : 2012 
(1998)). To date, no complete genome of a plant has been 
sequenced, nor has a complete cDNA complement of any plant been 
sequenced . 

SUMMARY OF THE INVENTION 

The present invention comprises polynucleotides, such as 
complete cDNA sequences and/or sequences of genomic DNA 
encompassing complete genes, portions of genes , and/or 
intergenic regions, hereinafter collectively referred to as 
"Sequence-Determined DNA Fragments" (SDFs) , from plants, 
particularly corn and Arabidopsis thaliana and polypeptides 
derived therefrom. In some instances, the SDFs span the 
entirety of a protein-coding segment. In some instances, the 
entirety of an mRNA is represented. Other objects of the 
invention are the control sequences, such as but not limited to 
promoters, that are also represented by SDFs of the invention. 
Complements of any sequence of the invention are also 
considered part of the invention. 
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Other objects of the invention are polynucleotides 
comprising exon sequences, polynucleotides comprising intron 
sequences, polynucleotides comprising introns together with 
exons, intron/exon junction sequences, 5' untranslated 
sequences, and 3' untranslated sequences of the SDFs of the 
present invention. Polynucleotides representing the joinder of 
any exons described herein, in any arrangement, for example, to 
produce a sequence encoding any desirable amino acid sequence 
are within the scope of the invention. 

The present invention also resides in probes useful for 
isolating and identifying nucleic acids that hybridize to an 
SDF of the invention. The probes are typically of a length of 
12 to 2000 nucleotides long; more typically, 15 to 200 
nucleotides long; even more typically, 18 to 100 nucleotides 
long . 

Yet another object of the invention is a method of 
isolating and/or identifying nucleic acids using the following 
steps : 

(a) contacting a probe of the instant invention with a 
polynucleotide sample under conditions that permit 
hybridization and formation of a polynucleotide duplex; and 

(b) detecting and/or isolating the duplex of step (a) . 
The conditions for hybridization can be from low to 

moderate to high stringency conditions. The sample can include 
a polynucleotide having a sequence unique in a plant genome. 
Probes and methods of the invention are useful, for example, 
without limitation, for mapping of genetic traits and/or for 
positional cloning of a desired portion of genomic DNA. 

Probes and methods of the invention can also be used for 
detecting alternatively spliced messages within a species. 
Probes and methods of the invention can further be used to 
detect or isolate related genes in other plant species using 
genomic DNA (gDNA) and/or cDNA libraries. In some instances, 
especially when longer probes and low to moderate stringency 
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hybridization conditions are used, the probe will hybridize to 
a plurality of cDNA and/or gDNA sequences of a plant. This 
approach is useful for isolating representatives of gene 
families which are identifiable by possession of a common 
functional domain in the gene product or which have common cis- 
acting regulatory sequences. This approach is also useful for 
identifying orthologous genes from other organisms, which can 
be more or less related to corn, Arabidopsis, or another plant. 

The present invention also resides in constructs for 
modulating the expression of the genes comprised of all or a 
portion of an SDF. The constructs comprise all or a portion 
of the expressed SDF, or of a complementary sequence. 
Examples of constructs include ribozymes comprising RNA 
encoded by an SDF or by a sequence complementary thereto, 
antisense constructs, constructs comprising coding regions or 
parts thereof, constructs comprising promoters, introns, 
untranslated regions, etc. When inserted into a host cell 
the construct is, preferably, functionally integrated with or 
operatively linked to a heterologous polynucleotide. For 
instance, a coding region from an SDF might be operably 
linked to a promoter that is functional in a plant. 

The present invention also resides in host cells, 
including bacterial or yeast cells or plant cells, and 
transgenic plants that harbor constructs such as described 
above. Another aspect of the invention relates to methods for 
modulating expression of specific genes in transgenic plants by 
expression of the structural gene component of the constructs, 
by regulation of expression of one or more endogenous genes in 
a transgenic plant or by suppression of expression of the 
polynucleotides of the invention in a transgenic plant. 
Methods of modulation of gene expression include without 
limitation (1) inserting into a host cell additional copies of 
a polynucleotide comprising a coding sequence; (2) modulating 
an endogenous promoter in a host cell; (3) inserting antisense 
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or ribozyme constructs into a host cell and (4) inserting into 
a host cell a polynucleotide comprising a sequence encoding a 
mutant, fragment, or fusion of the native polypeptides of the 
instant invention . 



BRIEF DESCRIPTION OF THE TABLES 

The sequences of exemplary SDFs and polypeptides encoded 
thereby of the instant invention are listed in SEQ TABLES 1 
and 2; annotation relevant to the sequences shown in SEQ 
TABLES 1 and 2 is presented in REF TABLES 1 and 2. Each 
sequence corresponds to a Maximum Length cDNA Polynucleotide 
Sequence. Each Maximum Length cDNA Polynucleotide Sequence 
(Ac) corresponds to at least one sequence in SEQ TABLE 1 and 
2. REF TABLE 1 corresponds with SEQ TABLE 1; REF TABLE 2 
corresponds with SEQ TABLE 2. 

REF TABLES 1 and 2 are Reference Tables which correlate 
each of the sequences and SEQ ID NOS in SEQ TABLES 1 and 2 
with a corresponding Maximum Length cDNA Sequence (Ac) , Ceres 
(Applicant) sequence identifier and other information about 
the individual sequence. SEQ TABLES 1 and 2 are Sequence 
Tables containing the sequence of each nucleic acid and amino 
acid sequence. 

In REF TABLES 1 and 2, each section begins by 
identifying the Maximum Length cDNA Polynucleotide Sequence, 
indicating a "Clone ID 7 ' that is a number used for 
identification purposes by the applicant and in some 
instances a "Public Genomic DNA" sequence, indicated by a "gi 
No" . In those instances where a public sequence is recited, 
there follows information about gene annotations such as 
predicted exons. In this portion, after the description of 
the gene's functional unit, the starting and ending 
nucleotide number of the public sequence and the computer 
program used to generate the result are listed. "INIT" 
denotes an initial exon. " INTR" denotes an internal exon. 
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"TERM" denotes a terminal exon. For those sequences where 
the computer program is identified as "OCKHAM" , " INIT" 
denotes the first detected exon and "TERM" denotes the last 
detected exon. "SINGLE" denotes the entire mRNA coding 
5 portion is in one exon. Other notations are considered 
intuitive to the skilled artisan. 

In the next section (Ac) , the cDNA MLS is identified by 
its SEQ ID NO ("Pat. Appln. SEQ ID NO") and the Ceres 
sequence identifier ("Ceres seq_id" ) , which is also merely an 

10 identifier useful for the applicant. The designation of 
"Alternative transcription start sites" can include both 
positive and negative numbers. Positive numbers refer to the 
referenced SEQ ID NO directly. The positions indicated by 
negative numbers, if any, refer to positions in the public 

15 genomic sequences. In instances where there is a "Public 

Genomic DNA" referenced, the relevant genomic sequence can be 
found by direct reference to the nucleotide sequence 
indicated by the "gi" number shown for the Public Genomic 
DNA. In instances where there is no Public Genomic DNA 

20 referenced, the relevant nucleotide sequence for alignment is 
the nucleotide sequence associated with the amino acid 
sequence designated by a "gi" number in the section (Dp). In 
these instances, the nucleotide sequence is found in GENBANK 
by clicking on the link in the National Center for 

25 Biotechnology Information Entrez database. The numbering is 
relative to position 1 as determined by aligning the first 
residue of the MLS cDNA sequence ( SEQ ID NO *) with the 
genomic sequence corresponding to the relevant "gi" number. 

Subsection (B) lists SEQ ID NOS and Ceres seq_ids for 

30 polypeptide sequences encoded by the . cDNA sequence and the 
location of the start codon within the cDNA sequence that 
codes for the polypeptide. Subsection (B) also describes 
additional features within the polypeptide sequence. 
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Subsection (C) provides information regarding identified 
domains (where present) within the polypeptide and (where 
present) a name for the polypeptide. Subsection (Dp) provides 
(where present) information concerning amino acid sequences 
5 that are found to be related and have some sequence identity 
to the polypeptide sequences of SEQ TABLES 1 and 2. Those 
"related" sequences are identified by a xv gi" number and are 
amino acid sequences in the publicly accessible BLAST 
databases on the NCBI FTP web site (accessible at 

10 ncbi.nlm.gov/blast). The database at the NCBI FTP site 
utilizes the "gi" identifiers to assign by NCBI a unique 
identifier for each sequence in the databases, thereby 
providing a non-redundant database for sequences from various 
databases, including GenBank, EMBL f DBBJ (DNA Database of 

15 Japan) and PDB (Brookhaven Protein Data Bank) . Subsection 
(Ba) , when present, describes a sequence as being considered 
plant-specific (i.e. a gene found only in a plant) or 
describes a bichemical activity for the protein encoded by 
the exemplary SDF. Subsection (Dn) provides polynucleotide 

20 sequences (where present) related to the Maximum Length cDNA 
sequence . 

DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to (I) polynucleotides and methods 
of use thereof, such as 
25 IA. Probes, Primers and Substrates; 



IB. 



Methods of Detection and Isolation; 



30 



IC. 



B. 1 . Hybridization; 

B.2. Methods of Mapping; 

B.3. Southern Blotting; 

B.4. Isolating cDNA from Related Organisms; 

B.5. Isolating and/or Identifying Orthologous Genes 

Methods of Inhibiting Gene Expression 



C.l. 



Antisense 
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C.2. Ribozyme Constructs; 
C. 3 . Co-Suppression; 

C.4. Other Methods to Inhibit Gene Expression 

ID. Methods of Functional Analysis; 

IE. Promoter Sequences and Their Use; 

IF. UTR' s and/or Intron Sequences and Their Use; and 

IG. Coding Sequences and Their Use. 



The specification also discloses (II) polypeptides 
10 including, without limitation, native proteins, mutants, 
fragments, and fusions. Antibodies to said polypeptides are 
also disclosed. 

The specification also discloses (III) methods of 
modulating polypeptide production or activity. Examples of such 
15 methods include (i) suppressed, (ii) enhanced, and (iii) 
directed expression . 

The specification also discloses (IV) gene constructs and 
construction of expression vectors, including (IVA) coding 
sequences and (IVB) promoters, and (IVC) Signal Peptides, (V) 
20 transformation procedures to illustrate the invention by way of 
examples . 

I . Polynucleotides 

A number of the nucleotide sequences disclosed in SEQ 
TABLES 1 AND 2 herein as representative of the SDFs of the 

2 5 invention can be obtained by sequencing genomic DNA (gDNA) 

and/or cDNA from corn plants grown from HYBRID SEED # 35A19, 
purchased from Pioneer Hi-Bred International, Inc., Supply 
Management, P.O. Box 256, Johnston, Iowa 50131-0256. 

Exemplified SDFs of the invention represent portions of 

3 0 the genome of corn or Arabidopsls and/or represent mRNA 

expressed from that genome. The isolated nucleic acid of the 
invention also encompasses corresponding portions of the genome 
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and/or cDNA complement of other organisms as described in 
detail below. 

Starting material for cDNA synthesis for the exemplary 
corn cDNA clones having sequences presented in SEQ TABLES 1 
5 AND 2 was poly (A) -containing polysomal mRNAs from 
inflorescences and root tissues of corn plants grown from 
HYBRID SEED # 35A19. Male inflorescences and female (pre-and 
post-fertilization) inflorescences were isolated at various 
stages of development. Selection for poly (A) containing 
10 polysomal RNA was done using oligo d(T) cellulose columns, as 
described by Cox and Goldberg, "Plant Molecular Biology: A 
Practical Approach", pp. 1-35, Shaw ed. , c- 1988 by IRL, 
Oxford . 

Tissues were or each organ was individually pulverized 
15 and frozen in liquid nitrogen. Next, the samples were 
homogenized in the presence of detergents and then 
centrifuged. The debris and nuclei were removed from the 
sample and more detergents were added to the sample. The 
sample was centrifuged and the debris was removed. Then the 
20 sample was applied to a 2M sucrose cushion to isolate 
polysomes. The RNA was isolated by treatment with detergents 
and proteinase K followed by ethanol precipitation and 
centrif ugat ion . The polysomal RNA from the different tissues 
was pooled according to the following mass ratios: 15/15/1 
25 for male inflorescences, female inflorescences and root, 
respectively. The pooled material was then used for cDNA 
synthesis by the methods described below. 

The quality and the integrity of the polyA+ RNAs were 
evaluated. 

30 A number of the nucleotide sequences disclosed in SEQ 

TABLES 1 AND 2 herein as representative of the SDFs of the 
invention can also be obtained by sequencing genomic DNA from 
Arabidopsls thaliana, Wassilewski j a ecotype or by sequencing 
cDNA obtained from mRNA from such plants as described below. 
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This is a true breeding strain. Seeds of the plant are 
available from the Arabidopsis Biological Resource Center at 
the Ohio State University, under the accession number CS2360. 
Seeds of this plant were deposited under the terms and 
5 conditions of the Budapest Treaty at the American Type Culture 
Collection, Manassas, VA on August 31, 1999, and were assigned 
ATCC No. PTA-595. 

Starting material for cDNA synthesis for the exemplary 
Arabidopsis cDNA clones having sequences presented in SEQ 

10 TABLES 1 AND 2 was polysomal RNA isolated from the top-most 
inflorescence tissues and roots of Arabidopsis thaliana 
Landsberg erecta (L. er.) also obtained from the Arabidopsis 
Biological Resource Center. Nine parts inflorescence to 
every part root was used, as measured by mass. Tissue was 

15 pulverized and exposed to liquid nitrogen. Next, the sample 
was homogenized in the presence of detergents and then 
centrifuged. The debris and nuclei were removed from the 
sample and more detergents were added to the sample. The 
sample was centrifuged c.nd the debris was removed and the 

20 sample was applied to a 2M sucrose cushion to isolate 
polysomal RNA. Cox et al., "Plant Molecular Biology: A 
Practical Approach", pp. 1-35, Shaw ed., c. 1988 by IRL, 
Oxford. The polysomal RNA was used for cDNA synthesis by 
the methods described below. Polysomal mRNA was then isolated 

25 as described above for corn cDNA. The quality of the RNA was 
assessed elect ropho ret ically . 

Following preparation of the mRNAs from various tissues 
as described above, selection of mRNA with intact 5' ends and 
specific attachment of an oligonucleotide tag to the 5' end of 

30 such mRNA was performed using either a chemical or enzymatic 
approach. Both techniques take advantage of the presence of 
the "cap" structure, which characterizes the 5' end of most 
intact mRNAs and which comprises a guanosine generally 
methylated once, at the 7 position. 
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The chemical modification approach involves the optional 
elimination of the 2', 3'-cis diol of the 3' terminal ribose, 
the oxidation of the 2', 3'-cis diol of the ribose linked to 
the cap of the 5' ends of the mRNAs into a dialdehyde, and the 
5 coupling of the such obtained dialdehyde to a derivatized 
oligonucleotide tag. Further detail regarding the chemical 
approaches for obtaining mRNAs having intact 5' ends are 
disclosed in International Application No. W096/34981 published 
November 7, 1996. 

10 The enzymatic approach for ligating the oligonucleotide 

tag to the intact 5' ends of mRNAs involves the removal of the 
phosphate groups present on the 5' ends of uncapped incomplete 
mRNAs , the subsequent decapping of mRNAs having intact 5' ends 
and the ligation of the phosphate present at the 5' end of the 

15 decapped mRNA to an oligonucleotide tag. Further detail 
regarding the enzymatic approaches for obtaining mRNAs having 
intact 5' ends are disclosed in Dumas Milne Edwards J.B. 
(Doctoral Thesis of Paris VI University, Le clonage des ADNc 
complets: difficultes et perspectives nouvelles. Apports pour 

20 l f etude de la regulation de 1' expression de la tryptophane 
hydroxylase de rat, 20 Dec. 1993), EP0 625572 and Kato et al., 
Gene 15_0:243-250 (1994). 

In both the chemical e^and the enzymatic approach, the 
oligonucleotide tag has a restriction enzyme site (e.g. an 

25 EcoRI site) therein to facilitate later cloning procedures. 
Following attachment of the oligonucleotide tag to the mRNA, 
the integrity of the mRNA is examined by performing a Northern 
blot using a probe complementary to the oligonucleotide tag. 

For the mRNAs joined to oligonucleotide tags using either 

30 the chemical or the enzymatic method, first strand cDNA 
synthesis is performed using an oligo-dT primer with reverse 
transcriptase. This oligo-dT primer can contain an internal 
tag of at least 4 nucleotides, which can be different from one 
mRNA preparation to another. Methylated dCTP is used for cDNA 
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first strand synthesis to protect the internal EcoRI sites from 
digestion during subsequent steps. The first strand cDNA is 
precipitated using isopropanol after removal of RNA by alkaline 
hydrolysis to eliminate residual primers. 
5 Second strand cDNA synthesis is conducted using a DNA 

polymerase, such as Klenow fragment and a primer corresponding 
to the 5' end of the ligated oligonucleotide. The primer is 
typically 20-25 bases in length. Methylated dCTP is used for 
second strand synthesis in order to protect internal EcoRI 

10 sites in the cDNA from digestion during the cloning process. 

Following second strand synthesis, the full-length cDNAs 
are cloned into a phagemid vector, such as pBlueScript™ 
(Stratagene) . The ends of the full-length cDNAs are blunted 
with T4 DNA polymerase (Biolabs) and the cDNA is digested with 

15 EcoRI. Since methylated dCTP is used during cDNA synthesis, 
the EcoRI site present in the tag is the only hemi-methylated 
site; hence the only site susceptible to EcoRI digestion. In 
some instances, to facilitate subcloning, an Hind III adapter 
is added to the 3' end of full-length cDNAs. 

20 The full-length cDNAs are then size fractionated using 

either exclusion chromatography (AcA, Biosepra) or 
elect rophoretic separation which yields 3 to 6 different 
fractions. The full-length cDNAs are then directionally cloned 
either into pBlueScript™ using either the EcoRI and Smal 

25 restriction sites or, when the Hind III adapter is present in 
the full-length cDNAs, the EcoRI and Hind III restriction 
sites. The ligation mixture is transformed, preferably by 
electroporation, into bacteria, which are then propagated under 
appropriate antibiotic selection. 

30 Clones containing the oligonucleotide tag attached to 

full-length cDNAs are selected as follows. 

The plasmid cDNA libraries made as described above are 
purified (e.g. by a column available from Qiagen) . A positive 
selection of the tagged clones is performed as follows. 
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Briefly, in this selection procedure, the plasmid DNA is 
converted to single stranded DNA using phage Fl gene II 
endonuclease in combination with an exonuclease (Chang et al . , 
Gene 127 : 95 (1993)) such as exonuclease III or T7 gene 6 
5 exonuclease. The resulting single stranded DNA is then 
purified using paramagnetic beads as described by Fry et al . , 
Biotechniques 13_: 124 (1992) . Here the single stranded DNA is 
hybridized with a biotinylated oligonucleotide having a 
sequence corresponding to the 3' end of the oligonucleotide 

10 tag. Preferably, the primer has a length of 20-25 bases. 
Clones including a sequence complementary to the biotinylated 
oligonucleotide are selected by incubation with streptavidin 
coated magnetic beads followed by magnetic capture. After 
capture of the positive clones, the plasmid DNA is released 

15 from the magnetic beads and converted into double stranded DNA 
using a DNA polymerase such as ThermoSequenase™ (obtained from 
Amersham Pharmacia Biotech) . Alternatively, protocols such as 
the Gene Trapper™ kit (Gibco BRL) cat be used. The double 
stranded DNA is then transformed, preferably by 

20 electroporation, into bacteria. The percentage of positive 
clones having the 5' tag oligonucleotide is typically estimated 
to be between 90 and 98% from dot blot analysis. 

Following transformation, the libraries are ordered in 
microtiter plates and sequenced. The Arabldopsis library was 
deposited at the American Type Culture Collection on January 
7, 2000 as "E-coll liba 010600" under the accession number 



Other methods for cloning full-length cDNA are 
described, for example, by Seki et al . , Plant Journal 15:707- 
25 720 (1998) High efficiency cloning of Arabidopsis full- 
length cDNA by biotinylated Cap trapper"; Maruyama et al . , 
Gene 138 : 171 (1994) " Oligo-capping a simple method to replace 
the cap structure of eukaryotic mRNAs with 

oligoribonucleotides" ; and WO 96/34981. 
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It is contemplated that the nucleotide sequences 
presented herein may contain some small percentage of errors. 
These errors may arise in the normal course of determination of 
nucleotide sequences. Sequence errors can be corrected by 
5 obtaining seeds deposited under the accession numbers cited 
above, propagating them, isolating genomic DNA or appropriate 
mRNA from the resulting plants or seeds thereof, amplifying the 
relevant portion of the genomic DNA or mRNA using primers 
having a sequence that flanks the erroneous sequence, and 
10 sequencing the amplification product. 



I .A. Probes, Primers and Substrates 

SDFs of the invention can be applied to substrates for 
use in array applications such as, but not limited to, assays 
of global gene expression, for example under varying 

15 conditions of development, growth conditions. The arrays can 
also be used in diagnostic or forensic methods. 

Probes and primers of the instant invention will 
hybridize to a polynucleotide comprising a sequence in SEQ 
TABLES 1 AND 2. Though many different nucleotide sequences 

20 can encode an amino acid sequence, in some instances, the 
sequences of SEQ TABLES 1 AND 2 are preferred for encoding 
polypeptides of the invention. However, the sequence of the 
probes and/or primers of the instant invention need not be 
identical to those in SEQ TABLES 1 AND 2 or the complements 

25 thereof. For example, some variation in probe or primer 
sequence and/or length can allow additional family members to 
be detected, as well as orthologous genes and more 
taxonomically distant related sequences. Similarly probes 
and/or primers of the invention can include additional 

30 nucleotides that serve as a label for detecting the formed 
duplex or for subsequent cloning purposes. 

Probe length will vary depending on the application. 
For use as PCR primers, probes should be 12-40 nucleotides, 
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preferably 18-30 nucleotides long. For use in mapping, 
probes should be 50 to 500 nucleotides, preferably 100-250 
nucleotides long. For Southern hybridizations, probes as 
long as several kilobases can be used as explained below. 
5 The probes and/or primers can be produced by synthetic 

procedures such as the triester method of Matteucci et al . J. 
Am. Chem. Soc. 103 : 3185 ( 1981); or according to Urdea et al. 
Proc. Natl. Acad. 8_0:7461 (1981) or using commercially 
available automated oligonucleotide synthesizers. 

10 



I . B . Methods of Detection and Isolation 

B. 1 . Hybridization 
Probes and/or primers can be used for detection and/or 

15 isolation of polynucleotide sequences. Such polynucleotides are 
included in the subject matter of the invention. Depending on 
the stringency of the conditions under which these probes 
and/or primers are used, polynucleotides exhibiting a wide 
range of similarity to those in SEQ TABLES 1 AND 2 can be 

20 detected or isolated. 

"Stringency" is a function of probe length, probe 
composition (G + C content) , and salt concentration, organic 
solvent concentration, and temperature of hybridization or wash 
conditions. Stringency is typically compared by the parameter 

25 "T m ", which is the temperature at which 50% of the 
complementary molecules in the hybridization are hybridized, in 
terms of a temperature differential from T m . High stringency 
conditions are those providing a condition of T m - 5°C to T m - 
10°C. Medium stringency conditions are those providing T m - 

30 20°C to T m - 29°C. Low stringency conditions are those 
providing a condition of T m - 40°C to T m - 48°C. The 
relationship of hybridization conditions to T m (in °C) is 
expressed in the mathematical equation 
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T m = 81.5 -16.6(log 10 [Na + ] ) + 0.41(%G+C) - (600/N) (1) 

where N is the length of the probe. This equation works well 
for probes 14 to 70 nucleotides in length that are identical to 
the target sequence. The equation below for T m of DNA-DNA 
5 hybrids is useful for probes in the range of 50 to greater than 
500 nucleotides, and for conditions that include an organic 
solvent (formamide) . 

T m = 81.5 + 16.6 log { [Na + ] / { 1 + 0 . 7 (Na + ] ) } + 0 . 4 1 { %G+C) -500/L 0 . 63 (%f ozmcmide) (2) 

where L is the length of the probe in the hybrid. (P. 
10 Tijessen, " Hybridization with Nucleic Acid Probes" in 
Laboratory Techniques in Biochemistry and Molecular Biology , 
P.C. vand der Vliet, ed. , c. 1993 by Elsevier, Amsterdam.) 
The T m of equation (2) is affected by the nature of the 
hybrid; for DNA-RNA hybrids T m is 10-15°C higher than 
calculated, for RNA-RNA hybrids T m is 20-25°C higher. Because 
the T m decreases about 1 °C for each 1% decrease in homology 
when a long probe is used (Bonner et al., <J. Mol . Biol. 
8_1:123 (1973)), stringency conditions can be adjusted to 
favor detection of identical genes or related family members. 

Equation (2) is derived assuming equilibrium and 
therefore, hybridizations according to the present invention 
are most preferably performed under conditions of probe 
excess and for sufficient time to achieve equilibrium. The 
time required to reach equilibrium can be shortened by 
inclusion of a " hybridization accelerator" such as dextran 
sulfate or another high volume polymer in the hybridization 
buffer . 

When the practitioner wishes to examine the result of 
membrane hybridizations under a variety of stringencies, an 
efficient way to do so is to perform the hybridization under 
a low stringency condition, then to wash the hybridization 
membrane under increasingly stringent conditions. With 



20 



25 
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respect to wash steps, preferred stringencies lie within the 
ranges stated above; high stringency is 5-8°C below T m , medium 
stringency is 26-29°C below T m and low stringency is 45-48°C 
below T m . 

5 A number of methods known to those skilled in the art 

can be used with the probes and/or primers of the invention 
to isolate and detect polynucleotides, including, without 
limitation: Southerns, Northerns, Branched DNA hybridization 
assays, polymerase chain reaction, and variations thereof. 

10 When using SDFs to identify orthologous genes in other 

species, the practitioner will preferably adjust the amount 
of target DNA of each species so that, as nearly as is 
practical, the same number of genome equivalents are present 
for each species examined. This prevents faint signals from 

15 species having large genomes, and thus small numbers of 
genome equivalents per mass of DNA, from erroneously being 
interpreted as absence of the corresponding gene in the 
genome . 

A good general discussion of the factors for determining 
20 hybridization conditions is provided by Sambrook et al. 

("Molecular Cloning, a Laboratory Manual, 2nd ed. , c. 1989 by 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY; 
see esp. , chapters 11 and 12). Additional considerations and 
details of the physical chemistry of hybridization are provided 
25 by G.H. Keller and M.M. Manak "DNA Probes", 2 nd Ed. pp. 1-25, 
c. 1993 by Stockton Press, New York, NY. 

Hybridization of one nucleic acid to another constitutes 
a physical property that defines the subject SDF of the 
invention. Also, such hybridization imposes structural 

30 limitations on the pair. For example, for a probe molecule, 
given that the sequence of the probe nucleic acid is known 
and fixed, equation (2) indicates that the combined variation 
in GC content of the target DNA and mismatch between the 



WO 00/40695 PCTYUSOO/00466 

17 

probe and the hybridizing DNA is determined for any given 
hybridization buffer composition and T m . 

The probes and/or primers of the instant invention can 
be used to detect or isolate nucleotides that are "identical'' 
5 to the probes or primers. Two nucleic acid sequences or 
polypeptides are said to be "identical" if the sequence of 
nucleotides or amino acid residues, respectively, in the two 
sequences is the same when aligned for maximum correspondence 
as described below. The term "complementary to" is used 
10 herein to mean that the sequence can form a Watson-Crick base 
pair with a reference polynucleotide sequence. Complementary 
sequences can include nucleotides, such as inosine, that 
neither disrupt Watson-Crick base pairing nor contribute to 
the pairing. 

15 Optimal alignment of sequences for comparison may be 

conducted by the local homology algorithm of Smith and Waterman 
Add. APL . Math. 2:482 (1981), by the homology alignment 
algorithm of Needleman and Wunsch J. Mol. Biol. 48_:443 (1970), 
by the search for similarity method of Pearson and Lipman Proc. 

20 Natl. Acad. Sci. (USA) 815 : 2444 (1988), by computerized 
implementations of these algorithms (GAP, BESTFIT, BLAST, 
PASTA, and T FAST A in the Wisconsin Genetics Software Package, 
Genetics Computer Group (GCG) , 575 Science Dr., Madison, WI), 
or by inspection. Given that two sequences have been 

25 identified for comparison, GAP and BESTFIT are preferably 
employed to determine their optimal alignment. Typically, the 
default values of 5.00 for gap weight and 0.30 for gap weight 
length are used. 

The probes and/or primers of the invention can also be 

30 used to detect and/or isolate polynucleotides exhibiting at 
least 80% sequence identity with the sequences of SEQ TABLES 1 
AND 2 or fragments thereof. 

"Percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a comparison 
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window, wherein the portion of the polynucleotide sequence in 
the comparison window may comprise additions or deletions 
(e.g., gaps or overhangs) as compared to the reference sequence 
(which does not comprise additions or deletions) for optimal 
5 alignment of the two sequences. The percentage is calculated 
by determining the number of positions at which the identical 
nucleic acid base or amino acid residue occurs in both 
sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of 

10 positions in the window of comparison and multiplying the 
result by 100 to yield the percentage of sequence identity. 
"Percentage of sequence identity" can be determined by the 
algorithms described above. 

The term "substantial identity" between polynucleotide or 

15 polypeptide sequences refers to polynucleotide or polypeptide 
comprising a sequence that has at least 80% sequence identity, 
preferably at least 85%, more preferably at least 90% and most 
preferably at least 95%, even more preferably, at least 96%, 
■ 97%, 98% or 99% sequence identity compared to a reference 

20 sequence using the programs. 

Isolated polynucleotides within the scope of the 
invention also include allelic variants of the specific 
sequences presented in SEQ TABLES 1 AND 2. An "allelic 
variant" is a sequence that is a variant from that of the 

25 SDF, but represents the same chromosomal locus in the 
organism. Allelic variants can arise by normal genetic 
variation in a population. Allelic variants can also be 
produced by genetic engineering methods. An allelic variant 
can be one that is found in a naturally occurring plant, 

30 including a cultivar or ecotype. A silent allele can give 
rise to phenotypic and expression profiles. An allelic 
variant may or may not give rise to a phenotypic change, and 
may or may not be expressed. An expressed allele can result 
in a detectable change in the phenotype of the trait 
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represented by the locus. Allelic variations can occur in 
any portion of the gene sequence, including regulatory 
regions as well as structural regions. 

With respect to nucleotide sequences, degeneracy of the 
5 genetic code provides the possibility to substitute at least 
one base of the base sequence of a gene with a different base 
without causing the amino acid sequence of the polypeptide 
produced from the gene to be changed. Hence, the DNA of the 
present invention may also have any base sequence that has 
10 been changed from a sequence in SEQ TABLES 1 AND 2 by 
substitution in accordance with degeneracy of genetic code. 
References describing codon usage include: Carels et al., J. 
Mol. Evol. 46_: 45 (1998) and Fennoy et al., Nucl. Acids Res. 
21 (23) : 5294 (1993) . 

15 A. B.2. Mapping 

The isolated SDF DNA of the invention can be used to 
create various types of genetic and physical maps of the 
genome of corn, Arabidopsis or other plants. Some SDFs may 
be absolutely associated with particular phenotypic traits, 

20 allowing construction of gross genetic maps. While not all 
SDFs will immediately be associated with a phenotype, all 
SDFs can be used as probes for identifying polymorphisms 
associated with phenotypes of interest. Briefly, total DNA 
is isolated from individuals and is subsequently cleaved with 

25 one or more restriction enzymes, separated according to mass, 
transferred to a solid support, hybridized with SDF DNA and 
the pattern of fragments compared. Polymorphisms associated 
with a particular SDF are visualized as differences in the 
size of fragments produced between individual DNA samples 

30 after digestion with a particular restriction enzyme and 
hybridization with the SDF. After identification of 
polymorphic SDF sequences, linkage studies can be conducted. 
Recombinants produced are analyzed using the same restriction 
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enzyme/hybridization procedure. After identification of 
many polymorphisms using SDF sequences, linkage studies can 
be conducted by using the individuals showing polymorphisms 
as parents in crossing programs. F2 progeny recombinants or 
recombinant inbreds, for example, are then analyzed using the 
same restriction enzyme/hybridization procedure. The order 
of DNA polymorphisms along the chromosomes can be inferred 
based on the frequency with which they are inherited together 
versus independently. The closer two polymorphisms are 
together in a chromosome the higher the probability that they 
are inherited together. Integration of the relative 
positions of all the polymorphisms and associated marker SDFs 
produces a genetic map of the species, where the distances 
between markers reflect the recombination frequencies in that 
chromosome segment . 

The use of recombinant inbred lines for such genetic 
mapping is described for Arabidopsis by Alonso-Blanco et al. 
(Methods in Molecular Biology, vol.82, "Arabidopsis 
Protocols" , pp. 137-146, J.M. Mart inez-Zapater and J. 
Salinas, eds . , c. 1998 by Humana Press, Totowa, NJ) and for 
corn by Burr ("Mapping Genes with Recombinant Inbreds" , pp. 
249-254. In Freeling, M. and V. Walbot (Ed.), The Maize 
Handbook, c. 1994 by Springer-Verlag New York, Inc. : New 
York, NY, USA; Berlin Germany; Burr et al. Genetics (1998) 
118: 519; Gardiner, J. et al., (1993) Genetics 134: 917). 
However, this procedure is not limited to plants and can be 
used for other organisms (such as yeast) or for individual 
cells . 

The SDFs of the present invention can also be used for 
simple sequence repeat (SSR) mapping. Rice SSR mapping is 
described by Morgante et al. (The Plant Journal (1993) 3: 
165), Panaud et al . (Genome (1995) 38: 1170); Senior et al. 
(Crop Science (1996) 36: 1676), Taramino et al . (Genome 
(1996) 39: 277) and Ahn et al . {Molecular and General 
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Genetics (1993) 241: 483-90). SSR mapping can be achieved 
using various methods. In one instance, polymorphisms are 
identified when sequence specific probes flanking an SSR 
contained within an SDF are made and used in polymerase chain 
reaction (PCR) assays with template DNA from two or more 
individuals of interest. Here, a change in the number of 
tandem repeats between the SSR-flanking sequence produces 
differently sized fragments (U.S. Patent 5,766,847). 
Alternatively, polymorphisms can be identified by using the 
PCR fragment produced from the SSR-flanking sequence specific 
primer reaction as a probe against Southern blots 
representing different individuals (U.H. Refseth et al., 
(1997) Electrophoresis 18: 1519). 

Genetic and physical maps of crop species have many 
uses. For example, these maps can be used to devise 
positional cloning strategies for isolating novel genes from 
the mapped crop species. In addition, because the genomes of 
closely related species are largely syntenic (that is, they 
display the same ordering of genes within the genome) , these 
maps can be used to isolate novel alleles from wild relatives 
of crop species by positional cloning strategies. 

The various types of maps discussed above can be used 
with the SDFs of the invention to identify Quantitative Trait 
Loci (QTLs). Many important crop traits, such as the solids 
content of tomatoes, are quantitative traits and result from 
the combined interactions of several genes. These genes 
reside at different loci in the genome, oftentimes on 
different chromosomes, and generally exhibit multiple alleles 
at each locus. The SDFs of the invention can be used to 
identify QTLs and isolate specific alleles as described by de 
Vicente and Tanksley {Genetics 134 : 585 (1993)). In addition 
to isolating QTL alleles present crop species, the SDFs of 
the invention can also be used to isolate alleles from the 
corresponding QTL of wild relatives. Transgenic plants 
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having various combinations of QTL alleles can then be 
created and the effects of the combinations measured. Once 
an ideal allele combination has been identified, crop 
improvement can be accomplished either through 
biotechnological means or by directed conventional breeding 
programs (for review see Tanksley and McCouch, Science 
211: 1063 (1997) ) . 

In another embodiment the SDFs can be used to help 
create physical maps of the genome of corn, Arabidopsis and 
related species. Where SDFs have been ordered on a genetic 
map, as described above, then SDFs can be used as probes to 
discover which clones in large libraries of plant DNA 
fragments in YACs, BACs, etc. contain the same SDF or similar 
sequences, thereby facilitating the assignment of the large 
DNA fragments to chromosomal positions. Subsequently, the 
large BACs, YACs, etc. can be ordered unambiguously by more 
detailed studies of their sequence composition (e.g. Marra et 
al. (1997) Genomic Research 7:1072-1084) and by using their 
end or other sequences to find the identical sequences in 
other cloned DNA fragments. The overlapping of DNA 
sequences in this way allows large contigs of plant sequences 
to be built, that, when sufficiently extended, provide a 
complete physical map of. a chromosome. Sometimes the SDFs 
themselves will provide the means of joining cloned sequences 
into a contig. 

The patent publication WO95/35505 and U.S. Patents 
5,445,943 and 5,410,270 describe scanning multiple alleles of 
a plurality of loci using hybridization to arrays of 
oligonucleotides. These techniques are useful for each of the 
types of mapping discussed above. 

Following the procedures described above and using 
a plurality of the SDFs of the present invention, any 
individual can be genotyped. These individual genotypes can 
be used for the identification of particular cultivars, 
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varieties, lines, ecotypes and genetically modified plants or 
can serve as tools for subsequent genetic studies involving 
multiple phenotypic traits. 

B . 3 Southern Blot Hybridization 

The sequences from SEQ TABLES 1 AND 2 can be used as 
probes for various hybridization techniques. These 
techniques are useful for detecting target polynucleotides in 
a sample or for determining whether transgenic plants, seeds 
or host cells harbor a gene or sequence of interest and thus 
might be expected to exhibit a particular trait or phenotype . 

In addition, the hybridization of the SDFs of the 
invention to nucleic acids obtained from other organisms can 
be used to identify orthologous genes from other species 
and/or additional members of gene families either in the same 
or different species. In regard to identifying genes in 
other species, a Southern blot of genomic DNA provides 
description of isolated DNA fragments that comprise the 
orthologous genes or additional members of the gene families. 
That is, given such data, one of ordinary skill in the art 
could distinguish the isolated DNA fragments by their size 
together with the restriction sites at each end and by the 
property of hybridizing with the SDF probe under the stated 
conditions . 

In addition, the SDFs from the invention can be used to 
isolate additional members of gene families from the same 
species and/or orthologous genes from different species. 
This is accomplished by hybridizing an SDF to a Southern blot 
containing the appropriate genomic DNA or cDNA. Given the 
resulting hybridization data, one of ordinary skill in the 
art could distinguish and isolate the correct DNA fragments 
by size, restriction sites and stated hybridization 
conditions from a gel or from a library. 
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Identification and isolation of orthologous genes from 
closely related species and alleles within a species is 
particularly desirable because of their potential for crop 
improvement. Many important crop traits, such as the solid 
content of tomatoes, result from the combined interactions of 
the products of several genes residing at different loci in 
the genome. Generally, alleles at each of these loci can 
make quantitative differences to the trait. By identifying 
and isolating numerous alleles for each locus from within or 
different species, transgenic plants having various 
combinations of alleles can be created and the effects of the 
combinations measured. Once a more favorable ideal allele 
combination has been identified, crop improvement can be 
accomplished either through biotechnological means or by 
directed conventional breeding programs (Tanksley et al. 
Science 277 : 10 63 (1997) ) . 



The results from hybridizations of the SDFs of the invention 
to Southern blots containing DNA from another species can 
also be used to generate restriction fragment maps for the 
corresponding genomic regions. These maps provide map 

provides additional information about the relative positions 
of restriction sites within fragments, further distinguishing 
mapped DNA from the remainder of the genome. 

Physical maps can be made by digesting genomic DNA with 
different combinations of restriction enzymes. 

Probes for Southern blotting to distinguish individual 
restriction fragments can range in size from 15 to 20 
nucleotides to several thousand nucleotides. More 
preferably, the probe is 100 to 1000 nucleotides long for 
identifying members of a gene family when it is found that 
repetitive sequences would complicate the hybridization. For 
identifying an entire corresponding gene in another species, 
the probe is more preferably the length of the gene, 
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typically 2000 to 10,000 nucleotides, but probes 50-1,000 
nucleotides long might be used. Some genes, however, might 
require probes up to 15,000 nucleotides long or overlapping 
probes constituting the full-length sequence to span their 
lengths . 

Also, while it is preferred that the probe be 
homogeneous with respect to its sequence, that is not 
necessary. For example, as described below, a probe 

representing members of a gene family having diverse 
sequences can be generated using PCR to amplify genomic DNA 
or RNA templates using primers derived from SDFs that include 
sequences that define the gene family. 

For identifying corresponding genes in another species, 
the probe for Southern blotting most preferably would be the 
genomic copy of the probe gene. This allows all elements of 
the gene to be identified in the other species. The next 
most preferable probe is a cDNA spanning the entire coding 
sequence, which allows all of the mRNA-coding portion of the 
gene to be identified; in this case it is possible that some 
introns in the gene might be missed. Probes for Southern 
blotting can easily be generated from SDFs by making primers 
having the sequence at the ends of the SDF and using corn or 
Arabidopsis genomic DNA as a template. In instances where 
the SDF includes sequence conserved among species, primers 
including the conserved sequence can be used for PCR with 
genomic DNA from a species of interest to obtain a probe. 
Similarly, if the SDF includes a domain of interest, that 
portion of the SDF can be used to make primers and, with 
appropriate template DNA, used to make a probe to identify 
genes containing the domain. Alternatively, the PCR products 
can be resolved, for example by gel electrophoresis, and 
cloned and/or sequenced. In this manner, the variants of the 
domain among members of a gene family, both within and across 
species, can be examined. 
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B.4.1 Isolating DNA from Related Organisms 

The SDFs of the invention can be used to isolate the 
corresponding DNA from other organisms. Either cDNA or 
genomic DNA can be isolated. For isolating genomic DNA, a 
lambda, cosmid, BAC or YAC, or other large insert genomic 
library from the plant of interest can be constructed using 
standard molecular biology techniques as described in detail 
by Sambrook et al . 1989 (Molecular Cloning: A Laboratory 
Manual, 2 nd ed. Cold Spring Harbor Laboratory Press, New York) 
and by Ausubel et al . 1992 (Current Protocols in Molecular 
Biology, Greene Publishing, New York) . 

To screen a phage library, recombinant lambda clones are 
plated out on appropriate bacterial medium using an 
appropriate E. coli host strain. The resulting plaques are 
lifted from the plates using nylon or nitrocellulose filters. 
The plaque lifts are processed through denaturation, 
neutralization, and washing treatments following the standard 
protocols outlined by Ausubel et al. (1992). The plaque 
lifts are hybridized to either radioact ively labeled or non- 
radioactive^ labeled SDF DNA at room temperature for about 
16 hours, usually in the presence of 50% formamide and 5X SSC 
(sodium chloride and sodium citrate) buffer and blocking 
reagents. The plaque lifts are then washed at 42°C with 1% 
Sodium Dodecyl Sulfate (SDS) and at a particular 
concentration of SSC. The SSC concentration used is 

dependent upon the stringency at which hybridization occurred 
in the initial Southern blot analysis performed. For 
example, if a fragment hybridized under medium stringency 
(e.g., Tm - 20°C), then this condition is maintained or 
preferably adjusted to a less stringent condition (e.g., Tm- 
30°C) to wash the plaque lifts. Positive clones show 

detectable hybridization e.g., by exposure to X-ray films or 
chromogen formation. The positive clones are then 
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subsequently isolated for purification using the same general 
protocol outlined above. Once the clone is purified, 

restriction analysis can be conducted to narrow the region 
corresponding to the gene of interest. The restriction 
analysis and succeeding subcloning steps can be done using 
procedures described by, for example Sambrook et al. (1989) 
cited above. 

To screen a YAC library, the procedures outlined for the 
lambda library are essentially similar except the YAC clones 
are harbored in bacterial colonies. The YAC clones are 
plated out at reasonable density on nitrocellulose or nylon 
filters supported by appropriate bacterial medium in petri 
plates. Following the growth of the bacterial clones, the 
filters are processed through the denaturat ion, 
neutralization, and washing steps following the procedures of 
Ausubel et al. 1992. The same hybridization procedures for 
lambda library screening are followed. 

To isolate cDNA, similar procedures using appropriately 
modified vectors are employed. For instance, the library can 
be constructed in a lambda vector appropriate for cloning 
cDNA such as Xgtll. Alternatively, the cDNA library can be 
made in a plasmid vector. cDNA for cloning can be prepared 
by any of the methods known in the art, but is preferably 
prepared as described above. Preferably, a cDNA library will 
include a high proportion of full-length clones. 

B . 5 . Isolating and/or Identifying Orthologous Genes 

Probes and primers of the invention can be used to 
identify and/or isolate polynucleotides related to those in SEQ 
TABLES 1 AND 2. Related polynucleotides are those that are 
native to other plant organisms and exhibit either similar 
sequence or encode polypeptides with similar biological 
activity. One specific example is an orthologous gene, a gene 
that has a high degree of sequence similarity, often along the 
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entire length of the coding portion of the gene, and also 
encodes a gene product that performs a similar function in the 
organism. Orthologous genes may be distinguished from 

homologous genes in that homologous genes share sequence 
similarity but often only in a portion of the sequence, which 
often represents a functional domain such as a tyrosine kinase 
activity, a DNA binding domain, or the like. The functional 
activities of homologous genes are not necessarily the same, 
but are the same for orthologous genes. The degree of identity 
is a function of evolutionary separation and, in closely 
related species, the degree of identity can be 98 to 100%. The 
amino acid sequence of a protein encoded by an orthologous gene 
can be less than 75% identical, but tends to be at least75% or 
at least 80% identical, more preferably at least 90%, most 
preferably at least 95% identical to the amino acid sequence of 
the reference protein. 

To find orthologous genes, the probes are hybridized to 
nucleic acids from a species of interest under low stringency 
conditions and blots are then washed under conditions of 
increasing stringency. It is preferable that the wash 

stringency be such that sequences that are 85 to 100% identical 
will hybridize. More preferably, sequences 90 to 100% 

identical will hybridize and most preferably only sequences 
greater than 95% identical will hybridize. The low stringency 
condition is preferably one where sequences containing as much 
as 40-45% mismatches will be able to hybridize. This condition 
is established by T m - 40°C to Tm - 48°C (see below) . One of 
ordinary skill in the art will recognize that, due to 
degeneracy in the genetic code, amino acid sequences that are 
identical can be encoded by DNA sequences as little as 67% 
identical. Thus, it is preferable to make an overlapping 
series of shorter probes, on the order of 24 to 45 nucleotides, 
and individually hybridize them to the same arrayed library to 
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avoid the problem of degeneracy introducing large numbers of 
mismatches . 

As evolutionary divergence increases, genome sequences 
also tend to diverge. Thus, one of skill will recognize that 
searches for orthologous genes between more divergent species 
will require the use of lower stringency conditions compared 
to searches between closely related species. Also, 
degeneracy is more of a problem for searches in the genome of 
a species more distant evolut ionar ily from the species that 
is the source of the SDF probe sequences. 

Therefore the method described in Bouckaert et al., U.S. 
Ser. No. 60/121,700 Atty. Dkt. No. 2750-117P, Client Dkt . No. 
00010.001, filed February 25, 1999, hereby incorporated in 
its entirety by reference, can be applied to the SDFs of the 
present invention to isolate related genes from plant species 
which do not hybridize to the corn or Arabidopsls sequences 
of SEQ TABLES 1 AND 2. 

Identification of the relationship of nucleotide or 
amino acid sequences among plant species can be done by 
comparison of the subject nucleotide or amino acid sequence 
to the sequences of SDFs of the present application presented 
in SEQ TABLES 1 and 2. 

The SDFs of the invention can also be used as probes to 
search for genes that are related to the SDF within a 
species. Such related genes are typically considered to be 
members of a "gene family." In such a case, the sequence 
similarity will often be concentrated into one or a few 
portions of the sequence. The portions of similar sequence 
that define the gene family typically encode a portion of a 
protein or RNA that has an enzymatic or structural function. 
The degree of identity in the amino acid sequence of the 
domain that defines the gene family is preferably at least 
70%, more preferably 80 to 95%, most preferably 85 to 99%. 
To search for members of a gene family within a species, a 
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"low stringency" hybridization is usually performed, but this 
will depend upon the size, distribution and degree of 
sequence divergence of domains that define the gene family. 
SDFs encompassing regulatory regions can be used to identify 
" coordinately expressed" genes by using the regulatory region 
portion of the SDF as a probe. 

In the instances where the SDFs are identified as being 
expressed from genes that confer a particular phenotype, then 
the SDFs can also be used as probes to assay plants of 
different species for those phenotypes. 

I.C. Methods to Inhibit Gene Expression 

In some instances it is desirable to suppress expression 
of an endogenous or exogenous gene. A well-known instance is 
the FLAVOR-SAVOR™ tomato, in which the gene encoding ACC 
synthase is inactivated by an antisense approach, thus 
delaying softening of the fruit after ripening. See for 
example, U.S. Patent No. 5,859,330; U.S. Patent No. 
5,723,766; Oeller, et al, Science, 254:4 37-4 39(1991); and 
Hamilton et al, Nature, 346:284-287 (1990). Also, timing of 
flowering can be controlled by suppression of the FLOWERING 
LOCUS C; high levels of this transcript are associated with 
late flowering, while absence of FLC is associated with early 
flowering (S.D. Michaels et al., Plant Cell 11:949 (1999). 
Also, the transition of apical meristem from production of 
leaves with associated shoots to flowering is regulated by 
TERMINAL FLOWER 1 , APETALA1 and LEAFY. Thus, when it is 
desired to induce a transition from shoot production to 
flowering, it is desirable to suppress TFL1 expression (S.J. 
Liljegren, Plant Cell 1^:1007 (1999)).. As another instance, 
it has been found that suppression of the ethylene forming 
enzyme results in arrested ovule development and female 
sterility that can be reversed by application of ethylene (D. 
De Martinis et al., Plant Cell 11:1061 (1999)). The ability 



WO 00/40695 PCTYUS00/00466 

31 

to manipulate fertility of female plants is useful in the 
increasing fruit production and creating hybrids. 

In an instance when it is desired to express a dominant 
negative mutation, it often helpful to suppress expression of 
the endogenous, native protein. Expression of dominant 
negative mutant proteins is a useful tool for research, for 
example when a dominant negative mutation of a receptor is 
used to constitut ively activate or suppress a signal 
transduction cascade, allowing examination of the phenotype 
and thus the trait (s) controlled by that receptor and 
pathway. 

C . 1 Antisense 

In the case of polynucleotides used to inhibit expression 
of an endogenous gene, the introduced sequence need not be 
perfectly identical to a sequence of the target endogenous 
gene. The introduced polynucleotide sequence will typically be 
at least substantially identical (as determined above) to the 
target endogenous sequence. 

Some polynucleotide SDFs in SEQ TABLES 1 AND 2 represent 
sequences that are expressed in corn and/or Arabidopsis . 
Thus the invention includes using these sequences to generate 
antisense constructs to inhibit transcription and/or 
translation of said SDFs, typically in a plant cell. 

To accomplish this, a polynucleotide segment from the 
desired gene that can hybridize to the mRNA expressed from the 
desired gene (the "antisense segment' 7 ) is operably linked to a 
promoter such that the antisense strand of RNA will be 
transcribed when the construct is present in a host cell. A 
regulated promoter can be used in the construct to control 
transcription of the antisense segment so that transcription 
occurs only under desired circumstances. 

The antisense segment to be introduced generally will be 
substantially identical to at least a portion of the endogenous 
gene or genes to be repressed. The sequence, however, need not 
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be perfectly identical to inhibit expression. Further, the 
antisense product may hybridize to the untranslated region 
instead of or in addition to the coding portion of the gene. 
The vectors of the present invention can be designed such that 
the inhibitory effect applies to other proteins within a family 
of genes exhibiting homology or substantial homology to the 
target gene. 

For antisense suppression, the introduced antisense 
segment sequence also need not be full length relative to 
either the primary transcription product or fully processed 
mRNA. Generally, higher sequence identity can be used to 
compensate for the use of a shorter sequence. Furthermore, 
the introduced sequence need not have the same intron or exon 
pattern, and homology of non-coding segments may be equally 
effective. Normally, a sequence of between about 30 or 40 
nucleotides and the full length of the transcript should be 
used, though a sequence of at least about 100 nucleotides is 
preferred, a sequence of at least about 200 nucleotides is 
more preferred, and a sequence of at least about 500 
nucleotides is especially preferred. 

C . 2 . Ribozymes 

It is also contemplated that gene constructs 
representing ribozymes and based on the SDFs in SEQ TABLES 1 
AND 2 are an object of the invention. Ribozymes can also be 
used to inhibit expression of genes by suppressing the 
translation of the mRNA into a polypeptide. It is possible to 
design ribozymes that specifically pair with virtually any 
target RNA and cleave the phosphodiester backbone at a specific 
location, thereby functionally inactivating the target RNA. In 
carrying out this cleavage, the ribozyme is not itself altered, 
and is thus capable of recycling and cleaving other molecules, 
making it a true enzyme. The inclusion of ribozyme sequences 
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within antisense RNAs confers RNA-cleaving activity upon them, 
thereby increasing the activity of the constructs. 

A number of classes of ribozymes have been identified. 
One class of ribozymes is derived from a number of small 
5 circular RNAs, which are capable of self-cleavage and 
replication in plants. The RNAs replicate either alone 
(viroid RNAs) or with a helper virus (satellite RNAs) . 
Examples include RNAs from avocado sunblotch viroid and the 
satellite RNAs from tobacco ringspot virus, lucerne transient 

10 streak virus, velvet tobacco mottle virus, solanum nodiflorum 
mottle virus and subterranean clover mottle virus. The 
design and use of target RNA-specific ribozymes is described 
in Haselhoff et al . Nature, 334 : 585 (1988). 

Like the antisense constructs above, the ribozyme 

15 sequence portion necessary for pairing need not be identical 
to the target nucleotides to be cleaved, nor identical to the 
sequences in SEQ TABLES 1 AND 2 . Generally, the sequence in 
the ribozyme capable of binding to the target sequence 
exhibits substantial sequence identity to a sequence in SEQ 

20 TABLES 1 AND 2 or the complement thereof, or to a portion of 
said sequence or complement. Further, the ribozyme sequence 
also need not be full length relative to either the primary 
transcription product or fully processed mRNA. The ribozyme 
can be equally effective in inhibiting mRNA translation by 

25 cleaving either in the untranslated or coding regions. 
Generally, higher sequence identity can be used to compensate 
for the use of a shorter sequence. Furthermore, the 
introduced sequence need not have the same intron or exon 
pattern, and homology of non-coding segments may be equally 

30 effective. 



C . 3 . Sense Suppression 

Another method of suppression is by 
exogenous copy of the gene to be suppressed. 



introducing an 
Introduction of 
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expression cassettes in which a nucleic acid is configured in 
the sense orientation with respect to the promoter into the 
chromosome of a plant or by a self -replicating virus has been 
shown to be an effective means by which to induce degradation 
5 of mRNAs of target genes- For an example of the use of this 
method to modulate expression of endogenous genes see, Napoli 
et al., The Plant Cell 2:279 (1990), and U.S. Patents Nos. 
5,034,323, 5,231,020, and 5,283,184. Generally, where 
inhibition of expression is desired, some transcription of the 

10 introduced sequence is probably necessary. The effect may 
occur where the introduced sequence contains no coding sequence 
per se, but comprises only intron or untranslated sequences 
homologous to sequences present in the primary transcript of 
the endogenous sequence. The introduction of only regulatory 

15 promoter sequences can also cause interference with the 
activity of endogenous promoters possessing the same sequence. 
Thus, the described SDFs can also be used to control 
transcription. In all of these procedures, the introduced 
sequence generally will be substantially identical to the 

20 endogenous sequence intended to be inactivated. The minimal 
identity will typically be greater than about 65%, but a higher 
identity might exert a more effective repression of expression 
of the endogenous sequences. Sequence identity of more than 
about 80% is preferred, though about 95% to absolute identity 

25 would be most preferred. As with antisense regulation, the 
effect would likely apply to any other proteins within a 
similar family of genes exhibiting homology or substantial 
homology to the suppressing sequence. 



C . 4 . Other Methods to Inhibit Gene Expression 
30 Yet another means of suppressing gene expression is to 

insert a polynucleotide into the gene of interest to disrupt 
transcription or translation of the gene. 



WO 00/40695 PCT/US00/00466 

35 

Low frequency homologous recombination can be used to 
target a polynucleotide insert to a gene by flanking the 
polynucleotide insert with sequences that are substantially 
similar to the gene to be disrupted. Sequences from SEQ TABLES 
5 1 AND 2, fragments thereof, and substantially similar sequence 
thereto can be used for homologous recombination. 

In addition, random insertion of polynucleotides into a 
host cell genome can also be used to disrupt the gene of 
interest. Azpiroz-Leehan et al . , Trends in Genetics 13 : 152 

10 (1997). In this method, screening for clones from a library 
containing random insertions is preferred to identifying those 
that have polynucleotides inserted into the gene of interest. 
Such screening can be performed using probes and/or primers 
described above based on sequences from SEQ TABLES 1 AND 2, 

15 fragments thereof, and substantially similar sequence thereto. 
The screening can also be performed by selecting clones or R x 
plants having a desired phenotype. 

I . D . Methods of Functional Analysis 

The constructs described in the methods under I.C. above 
20 can be used to determine the function of the polypeptide 
encoded by the gene that is targeted by the constructs. 

Down-regulating the transcription and translation of the 
targeted gene, the host cell or organisms, such as a plant, 
may produce phenotypic changes as compared to a wild-type 
25 cell or organism. In addition, in vitro assays can be used 
to determine if any biological activity, such as calcium 
flux, DNA transcription, nucleotide incorporation, etc., are 
being modulated by the down-regulation of the targeted gene. 

Coordinated regulation of sets of genes, e.g., those 
30 contributing to a desired polygenic trait, is sometimes 
necessary to obtain a desired phenotype. SDFs of the 

invention representing transcription activation and DNA 
binding domains can be assembled into hybrid transcriptional 
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activators. These hybrid transcriptional activators can be 
used with their corresponding DNA elements (i.e., those bound 
by the DNA-binding SDFs) to effect coordinated expression of 
desired genes (J.J. Schwarz et al . , Mol. Cell. Biol. 12:266 
5 (1992), A. Martinez et al . , Mol. Gen. Genet. 261 : 546 (1999)). 

The SDFs of the invention can also be used in the two- 
hybrid genetic systems to identify networks of protein- 
protein interactions (L. McAlister-Henn et al . , Methods 
19:330 (1999), J.C. Hu et al . , Methods 20:80 (2000), M. 

10 Golovkin et al . , J. Biol. Chem. 21 A : 36428 (1999), K. Ichimura 
et al., Biochem. Biophys. Res. Comm. 253:532 (1998)). The 
SDFs of the invention can also be used in various expression 
display methods to identify important protein-DNA 
interactions (e.g. B. Luo et al . , J. Mol. Biol. 266 :479 

15 (1997) ) . 

I.E. Promoters 

The SDFs of the invention are also useful as structural 
or regulatory sequences in a construct for modulating the 
expression of the corresponding gene in a plant or other 

20 organism, e.g. a symbiotic bacterium. For example, promoter 
sequences represented in SEQ TABLES 1 AND 2 can be useful in 
directing expression of coding sequences either as constitutive 
promoters or to direct expression in particular cell types, 
tissues, or organs or in response to environmental stimuli. 

25 The term "promoter" refers to a region ofe^e sequence 

determinants located upstream or downstream from the start of 
transcription and which are involved in recognition and binding 
of RNA polymerase and other proteins to initiate transcription. 
A "plant promoter" is a promoter capable of initiating 

30 transcription in plant cells and can be used to drive 
expression of a translated portion of an SDF. Such promoters 
need not be of plant origin. For example, promoters derived 
from plant viruses, such as the CaMV35S promoter or from 
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Agrobacterium tumefaciens such as the T-DNA promoters, can be 
plant promoters. A typical example of a constitutive promoter 
of plant origin is the promoter of the cowpea trypsin inhibitor 
gene. Typical examples of temporal and/or tissue specific 
5 promoters of plant origin that can be used with the 
polynucleotides of the present invention, are: PTA29, a 
promoter which is capable of driving gene expression 
specifically in tapetum and only during anther development 
(Koltonow et al . , Plant Cell 2:1201 (1990); RCc2 and RCc3, 
10 promoters that direct root-specific gene expression in rice (Xu 
et al., Plant Mol. Biol. 27:237 (1995); TobRB27, a root- 
specific promoter from tobacco (Yamamoto et al., Plant Cell 
3:371 (1991) ) . 

By "specific promoters" is meant promoters that have a 

15 high preference of driving gene expression in the specified 
tissue and/or at the specified time during the concerned tissue 
or organ development. By "high preference" is meant at least 
3-fold, preferably 5-fold, more preferably at least 10-fold 
still more preferably at least 20-fold, 50-fold or 100-fold 

20 increase in expression in the desired tissue over the 
expression in any undesired tissue. 

A typical example of an inducible promoter, which can be 
utilized with the polynucleotides of the present invention, is 
PARSK1, the promoter from the Arabidopsis gene encoding a 

25 serine-threonine kinase enzyme, and which promoter is induced 
by dehydration, abscissic acid and sodium chloride (Wang and 
Goodman, Plant J. 8:37 (1995)). 

With respect to the SDFs of the present invention a 
promoter is likely to be a relatively small portion of a 

30 genomic DNA (gDNA) sequence located in the first 2000 
nucleotides upstream from an initial exon identified in a gDNA 
sequence or initial "ATG" or methionine codon in a 
corresponding cDNA or mRNA sequence. Such promoters are more 
likely to be found in the first 1000 nucleotides upstream of an 
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initial ATG or methionine codon of a cDNA sequence 
corresponding to a gDNA sequence. In particular, the promoter 
is usually located upstream of the transcription start site. 
Such a start site is located at the first exon predicted in the 
5 OCKHAM-cDNA predictions. In such an instance, the transcription 
start site is the first nucleotide of the 5' most exon, if the 
predictions are in the plus (+) strand, or the 3' most if the 
predictions are in the minus (-) strand. Alternative 
transcription start sites may be located between the first 

10 nucleotide of the 5' most exon (or the 3' most exon in the 
minus (-) strand) and the initial ATG or methionine codon in 
the cDNA sequence. The portions of a particular gDNA sequence 
that function as a promoter in a plant cell will preferably be 
found to hybridize at medium or high stringency to gDNA 

15 sequences presented in SEQ TABLES 1 AND 2. 

Promoters are generally modular in nature. Short DNA 
sequences representing binding sites for proteins can be 
separated from each other by intervening sequences of varying 
length. For example, within a particular functional module 

20 protein binding sites may be constituted by regions of 5 to 60, 
preferably 10 to 30, more preferably 10 to 20 nucleotides. 
Within such binding sites, there are typically 2 to 6 
nucleotides that specifically contact amino acids of the 
nucleic acid binding protein. The protein binding sites are 

25 usually separated from each other by 10 to several hundred 
nucleotides, typically by 15 to 150 nucleotides, often by 20 to 
50 nucleotides. DNA binding sites in promoter elements often 
display dyad symmetry in their sequence. Often elements 
binding several different proteins, and/or a plurality of sites 

30 that bind the same protein, will be combined in a region of 100 
to 1000 basepairs. 

Elements that have transcription regulatory function can 
be isolated from their corresponding endogenous gene, or the 
desired sequence can be synthesized, and recombined in 
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constructs to direct expression of a structural gene in a 
desired tissue-specific, temporal-specific or other desired 
manner of inducibility or suppression. When hybridizations are 
performed to identify or isolate elements of a promoter by 
5 hybridization to the long sequences presented in SEQ TABLES 1 
AND 2, conditions should be adjusted to account for the above- 
described nature of promoters. For example short probes , 
constituting the element sought , should be used under low 
temperature and/or high salt conditions. When long probes, 

10 which might include several promoter elements are used, low to 
medium stringency conditions are preferred when hybridizing to 
promoters across species. 

Promoters can consist of a "basal promoter" that 
functions as a site for assembly of a transcription complex 

15 comprising an RNA polymerase, for example RNA polymerase II. A 
typical transcription complex will include additional factors 
such as TFnB, TF n D, and TF n E. Of these, TF n D appears to be 
the only one to bind DNA directly. Basal promoters frequently 
include a "TATA box" element usually located . ^tween 20 and 35 

20 nucleotides upstream from the site of initiation of 
transcription. Basal promoters also sometimes include a "CCAAT 
box" element (typically a sequence CCAAT) and/or a GGGCG 
sequence, usually located between 40 and 200 nucleotides, 
preferably 60 to 120 nucleotides, upstream from the start site 

25 of transcription. 

The promoter might also contain one or more "enhancers" 
and/or "suppressors" that function as binding sites for 
additional transcription factors that have the function of 
modulating the level of transcription with respect to tissue 

30 specificity of transcription, transcriptional responses to 
particular environmental or nutritional factors, and the like. 

If a nucleotide sequence of an SDF, or part of the SDF, 
functions as a promoter or portion of a promoter, then 
nucleotide substitutions, insertions or deletions that do not 
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substantially affect the binding of relevant DNA binding 
proteins would be considered equivalent to the exemplified 
nucleotide sequence. It is envisioned that there are 

instances where it is desirable to decrease the binding of 
5 relevant DNA binding proteins to "silence" or "down-regulate" 
a promoter, or conversely to increase the binding of relevant 
DNA binding proteins to "enhance" or " up-regulate" a 
promoter. In such instances, polynucleotides representing 
changes to the nucleotide sequence of the DNA-protein contact 
10 region by insertion of additional nucleotides, changes to 
identity of relevant nucleotides, including use of 
chemically-modified bases, or deletion of one or more 
nucleotides are considered encompassed by the present 
invention . 

15 Promoter function can be assayed by methods known in the 

art, preferably by measuring activity of a reporter gene 
operatively linked to the sequence being tested for promoter 
function. Examples of reporter genes include those encoding 
luciferase, green fluorescent protein, G/S, neo, cat and bar. 



20 I.F. UTRs and Junctions 

Polynucleotides comprising untranslated (UTR) sequences 
and intron/exon junctions are also within the scope of the 
invention. UTR sequences include introns and 5' or 3' 
untranslated regions ( 5' UTRs or 3' UTRs) . Portions of the 
2 5 sequences shown in SEQ TABLES 1 AND 2 can comprise UTRs and 
introns or intron/exon junctions. 

These portions of SDFs, especially UTRs, can have 
regulatory functions related to, for example, translation rate 
and mRNA stability. Thus, these portions of SDFs can be 
30 isolated for use as elements of gene constructs for expression 
of polynucleotides encoding desired polypeptides. 

Introns of genomic DNA segments might also have 
regulatory functions. Sometimes promoter elements, 
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especially transcription enhancer or suppressor elements, are 
found within introns. Also, elements related to stability of 
heteronuclear RNA and efficiency of transport to the 
cytoplasm for translation can be found in intron elements. 
5 Thus, these segments can also find use as elements of 
expression vectors intended for use to transform plants. 

Just as with promoters, introns and UTR sequences and 
intron/exon junctions can vary from those shown in SEQ TABLES 
1 AND 2. Such changes from those sequences preferably will 
10 not affect the regulatory activity of the UTRs or intron or 
intron/exon junction sequences on expression, transcription, 
or translation. However, in some instances, down-regulation 
of such activity may be desired to modulate traits or 
phenotypic or in vitro activity. 

15 I.G. Coding Sequences 

Isolated polynucleotides of the invention can include 
coding sequences that encode polypeptides comprising an amino 
acid sequence encoded by a sequences in SEQ TABLES 1 AND 2 or 
an amino acid sequence presented in SEQ TABLES 1 AND 2. 

20 A nucleotide sequence "encodes" a polypeptide if a cell 

(or a cell free in vitro system) expressing that nucleotide 
sequence produces a polypeptide having the recited amino acid 
sequence when the nucleotide sequence is transcribed and the 
primary transcript is subsequently processed and translated 

25 by a host cell (or a cell free in vitro system) harboring the 
nucleic acid. Thus, an isolated nucleic acid that "encodes" 
a particular amino acid sequence can be a genomic sequence 
comprising exons and introns or a cDNA sequence that 
represents the product of splicing thereof. An isolated 

30 nucleic acid "encoding an amino acid sequence" also 
encompasses heteronuclear RNA, which contains sequences that 
are spliced out during expression, and mRNA, which lacks 
those sequences. 
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Coding sequences can be constructed using chemical 
synthesis techniques or by isolating coding sequences or by 
modifying such synthesized or isolated coding sequences as 
described above. 

5 In addition to encoding the polypeptide sequences of SEQ 

TABLES 1 AND 2, which are native to corn or Arabidopsis , the 
isolated polynucleotides can be variant polynucleotides that 
encode mutants, fragments, and fusions of those native 
proteins. Such polypeptides are described below in part II. 

10 In variant polynucleotides generally, the number of 

substitutions, deletions or insertions is preferably less than 
20%, more preferably less than 15%; even more preferably less 
than 10%, 5%, 3% or 1% of the number of nucleotides comprising 
a particularly exemplified sequence. It is generally expected 

15 that non-degenerate nucleotide sequence changes that result in 
1 to 10, more preferably 1 to 5 and most preferably 1 to 3 
amino acid insertions, deletions or substitutions will not 
greatly affect the function of an encoded polypeptide. The most 
preferred embodiments are those wherein 1 to 20, preferably 1 

20 to 10, most preferably 1 to 5 nucleotides are added to, deleted 
from and/or substituted in the sequences specifically disclosed 
in SEQ TABLES 1 AND 2. 

Insertions or deletions in polynucleotides intended to be 
used for encoding a polypeptide should preserve the reading 

25 frame. This consideration is not so important in instances 
when the polynucleotide is intended to be used as a 
hybridization probe. 

II. Polypeptides 

Polypeptides within the scope of the invention include 
30 both native proteins as well as mutants, fragments, and 
fusions thereof. Polypeptides of the invention are those 
encoded by any of the six reading frames of sequences shown 
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in SEQ TABLES 1 AND 2, preferably encoded by the three frames 
reading in the 5' to 3' direction of the sequences as shown. 

Native polypeptides include the proteins encoded by the 
sequences shown in SEQ TABLES 1 AND 2. Such native 

5 polypeptides include those encoded by allelic variants. 

Variants, including mutants, will exhibit at least 80% 
sequence identity to those native polypeptides of SEQ TABLES 1 
AND 2. Sequence identity is used for polypeptides as defined 
above for polynucleotides. More preferably, the variants will 

10 exhibit at least 85% sequence identity; even more preferably, 
at least 90% sequence identity; more preferably at least 95%, 
96%, 97%, 98%, or 99% sequence identity. "Fragments" of 
polypeptide or "portions' 7 of polypeptides will exhibit similar 
degrees of identity to the relevant portions of the native 

15 polypeptide. Fusions will exhibit similar degrees of identity 
in that portion of the fusion represented by the variant of the 
native peptide. 

Furthermore, variants will exhibit at least one of the 
functional properties of the native protein. Such properties 

20 include, without limitation, protein interaction, DNA 
interaction, biological activity, immunological activity, 
receptor binding, signal transduction, transcription activity, 
growth factor activity, secondary structure, three-dimensional 
structure, etc. As to properties related to in vitro or in 

25 vivo activities, the variants preferably exhibit at least 60% 
of the activity of the native protein; more preferably at least 
70%, even more preferably at least 80%, 85%, 90% or 95% of at 
least one activity of the native protein. 

A type of mutant of the native polypeptides comprises 

30 amino acid substitutions. "Conservative substitut ions" are 
preferred to maintain the function or activity of the 
polypeptide. Such substitutions include conservation of 

charge, polarity, hydrophobicity , size, etc. For example, one 
or more amino acid residues within the sequence can be 
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substituted with another amino acid of similar polarity that 
acts as a functional equivalent, for example providing a 
hydrogen bond in an enzymatic catalysis. Substitutes for an 
amino acid within an exemplified sequence are preferably made 
5 among the members of the class to which the amino acid belongs. 
For example, the nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, 

10 asparagine, and glutamine. The positively charged (basic) 
amino acids include arginine, lysine and histidine. The 
negatively charged (acidic) amino acids include aspartic acid 
and glutamic acid. 

Within the scope of sequence identity described above, a 

15 polypeptide of the invention may have additional individual 
amino acids or amino acid sequences inserted into the 
polypeptide in the middle thereof and/or at the N-terminal 
and/or C-terminal ends thereof. Likewise, some of the amino 
acids or amino acid sequences may be deleted from the 

20 polypeptide. 

Antibodies 

Isolated polypeptides can be utilized to produce 
antibodies. Polypeptides of the invention can generally be 
used, for example, as antigens for raising antibodies by known 
techniques. The resulting antibodies are useful as reagents 
for determining the distribution of the antigen protein within 
the tissues of a plant or within a cell of a plant. The 
antibodies are also useful for examining the expression level 
of proteins in various tissues, for example in a wild-type 
plant or following genetic manipulation of a plant, by methods 
such as Western blotting. 

Antibodies of the present invention, both polyclonal and 
monoclonal, may be prepared by conventional methods. In 
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general, the polypeptides of the invention are first used to 
immunize a suitable animal, such as a mouse, rat, rabbit, or 
goat. Rabbits and goats are preferred for the preparation of 
polyclonal sera due to the volume of serum obtainable, and the 
5 availability of labeled anti-rabbit and anti-goat antibodies as 
detection reagents. Immunization is generally performed by 
mixing or emulsifying the protein in saline, preferably in an 
adjuvant such as Freund' s complete adjuvant, and injecting the 
mixture or emulsion parenterally (generally subcutaneously or 

10 intramuscularly) . A dose of 50-200 |ug/in j ection is typically 
sufficient. Immunization is generally boosted 2-6 weeks later 
with one or more injections of the protein in saline, 
preferably using Freund' s incomplete adjuvant. One may 

alternatively generate antibodies by in vitro immunization 

15 using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. 

Polyclonal antisera is obtained by bleeding the immunized 
animal into a glass or plastic container, incubating the blood 
at 25°C for one hour, followed by incubating the blood at 4°C 

20 for 2-18 hours. The serum is recovered by centrif ugation 
(e.g., l,000xg for 10 minutes). About 20-50 ml per bleed may 
be obtained from rabbits. 

Monoclonal antibodies are prepared using the method of 
Kohler and Milstein, Nature 256: 495 (1975), or modification 

25 thereof. Typically, a mouse or rat is immunized as described 
above. However, rather than bleeding the animal to extract 
serum, the spleen (and optinally several large lymph nodes) is 
removed and dissociated into single cells. If desired, the 
spleen cells can be screened (after removal of nonspecif ically 

30 adherent cells) by applying a cell suspension to a plate, or 
well, coated with the protein antigen. B-cells expressing 
membrane-bound immunoglobulin specific for the antigen bind to 
the plate, and are not rinsed away with the rest of the 
suspension. Resulting B-cells, or all dissociated spleen 
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cells, are then induced to fuse with myeloma cells to form 
hybridomas, and are cultured in a selective medium (e.g., 
hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are 
5 assayed for the production of antibodies which bind 
specifically to the immunizing antigen (and which do not bind 
to unrelated antigens) . The selected Mab-secreting hybridomas 
are then cultured either in vitro (e.g., in tissue culture 
bottles or hollow fiber reactors) , or in vivo (as ascites in 
10 mice) . 

Other methods for sustaining antibody-producing B-cell 
clones, such as by EBV transformation, are known. 

If desired, the antibodies (whether polyclonal or 
monoclonal) may be labeled using conventional techniques. 

15 Suitable labels include f luorophores, chromophores , radioactive 
atoms (particularly 32 P and 125 I) , electron-dense reagents, 
enzymes, and ligands having specific binding partners. Enzymes 
are typically detected by ther activity. For example, 

horseradish peroxidase is usually detected by its ability to 

20 convert 3, 3' , 5, 5' -tetramethylbenzidine (TNB) to a blue pigment, 
quantifiable with a spectrophotometer. "Specific binding 
partner" refers to a protein capable of binding a ligand 
molcule with high specificity, as for example in the case of an 
antigen and a monoclonal antibody specific therefor. Other 

25 specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the numerous receptor- 
ligand couples known in the art. It should be understood that 
the above description is not meant to categorize the various 
labels into distinct modes. For example, 125 I may serve as a 

30 radioactive label or as an electron-dense reagent. HRP may 
serve as an enzyme or as an antigen for a Mab. Further one may 
combine various labels for desired effect. For example, Mabs 
and avidin also require labels in the practice of this 
invention: thus, one might label a Mab with biotin, and detect 
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its presence with avidin labeled with 125 I, or with an anti- 
biotin Mab labeled with HRP. Other permutations and 

possibilities will be readily apparent to those of ordinary 
skill in the art, and are considered as equivalents within the 
5 scope of the instant invention. 

In Vitro Applications of Polypeptides 

Some polypeptides of the invention will have enzymatic 
activities that are useful in vitro. For example, the soybean 
trypsin inhibitor (Kunitz) family is one of the numerous 

10 families of proteinase inhibitors. It comprises plant 
proteins which have inhibitory activity against serine 
proteinases from the trypsin and subtilisin families, thiol 
proteinases and aspartic proteinases. Thus, these peptides 
find in vitro use in protein purification protocols and 

15 perhaps in therapeutic settings requiring topical application 
of protease inhibitors. 

Delta-aminolevulinic acid dehydratase (EC 4.2.1.24 ) 
(ALAD) catalyzes the second step in the biosynthesis of heme, 
the condensation of two molecules of 5-aminolevulinate to 

20 form porphobilinogen. Thus, ALAD proteins can be used as 
catalysts in synthesis of heme derivatives. Enzymes of 
biosynthetic pathways generally can be used as catalysts for 
in vitro synthesis of the compounds representing products of 
the pathway. 

25 Polypeptides encoded by SDFs of the invention can be 

engineered to provide purification reagents to identify and 
purify additional polypeptides that bind to them. This 
allows one to identify proteins that function as multimers or 
elucidate signal transduction or metabolic pathways. In the 

30 case of DNA binding proteins, the polypeptide can be used in 
a similar manner to identify the DNA determinants of specific 
binding (S. Pierrou et al., Anal. Biochem. 229 : 99 (1995), S. 
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Chusacultanachai et al., J. Biol. Chem. 274:23591 (1999), Q. 
Lin et al., J. Biol. Chem. 212 :21214 (1997)). 

II. A. MUTANTS, FRAGMENTS, AND FUSIONS 

Generally, mutants, fragments, or fusions of the 
polypeptides encoded by the maximum length seuqence (MLS ) can 
exhibit at least one of the activities of the identified 
domains and/or related polypeptides described in Sections (C) 
and (D) of REF TABLES 1 and 2 corresponding to the MLS of 
interest . 

II .A. (1) Mutants 

A type of mutant of the native polypeptides 
comprises amino acid substitutions- "Conservative 
substitutions" , described above (see II.), are preferred to 
maintain the function or activity of the polypeptide, 
polypeptide. Such substitutions include conservation of 
charge, polarity, hydrophobicity, size, etc. For example, one 
or more amino acid residues within the sequence can be 
substituted with another amino acid of similar polarity that 
acts as a functional equivalent, for example providing a 
hydrogen bond in an enzymatic catalysis. Substitutes for an 
amino acid within an exemplified sequence are preferably made 
among the members of the class to which the amino acid belongs. 
For example, the nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, 
asparagine, and glutamine. The positively charged (basic) 
amino acids include arginine, lysine and histidine. The 
negatively charged (acidic) amino acids include aspartic acid 
and glutamic acid. 

Within the scope of sequence identity described above, a 
polypeptide of the invention may have additional individual 
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amino acids or amino acid sequences inserted into the 
polypeptide in the middle thereof and/or at the N-terminal 
and/or C-terminal ends thereof. Likewise, some of the amino 
acids or amino acid sequences may be deleted from the 
polypeptide. Amino acid substitutions may also be made in the 
sequences; conservative substitutions being preferred. 

One preferred class of mutants are those that 
comprise (1) the domain of a MLS encoded polypeptide and/or 
(2) residues conserved between the MLS encoded polypeptide 
and related polypeptides of the MLS. For this class of 
mutants, the MLS encoded polypeptide sequence is changed by 
insertion, deletion, or substitution at positions flanking 
the domain and/or conserved residues. 

Another class of mutants includes those that 
comprise a MLS encoded polypeptide sequence that is changed 
in the domain or conserved residues by a conservative 
substitution . 

Yet another class of mutants includes those that 
lack one of the in vitro activities, or structural features 
of the MLS encoded polypeptides. One example is dominant 
negative mutants. Such a mutant may comprise an MLS encoded 
polypeptide sequence with non-conservative changes in a 
particular domain or group of conserved residues. 

II .A. (2) FRAGMENTS 

Fragments of particular interest are those that 
comprise a domain identified for a polypeptide encoded by an 
MLS of the instant invention and mutants thereof. Also, 
fragments that comprise at least one region of residues 
conserved between an MLS encoded polypeptide and its related 
polypeptides are of great interest. Fragments are sometimes 
useful as dominant negative mutations. 



II. A. (3) FUSIONS 
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Of interest are chimeras comprising (1) a fragment 
of the MLS encoded polypeptide or mutants thereof of interest 
and (2) a fragment of a polypeptide comprising the same 
domain. For example, an AP2 helix encoded by a MLS of the 
invention fused to second AP2 helix from ANT protein, which 
comprises two AP2 helices. The present invention also 
encompasses fusions of MLS encoded polypeptides, mutants, or 
fragments thereof fused with related proteins or fragments 
thereof . 

DEFINITION OF DOMAINS 

The polypeptides of the invention may possess 
identifying domains as shown in REF TABLES 1 and 2. Domains 
are fingerprints or signatures that can be used to 
characterize protein families and/or motifs. Such 
fingerprints or signatures can comprise conserved (1) primary 
sequence, (2) secondary structure, and/or (3) three- 
dimensional conformation. Generally, each domain has been 
associated with either a family of proteins or a motif. 
Typically, these families and/or motifs have been correlated 
with specific in-vitro and/or in-vivo activities. A domain 
can be any length, including the entirety of the sequence of 
a protein. Detailed descriptions of the domains, associated 
families and motifs, and correlated activities of the 
polypeptides of the instant invention are described below. 
Usually, the polypeptides with designated domain (s) can 
exhibit at least one activity that is exhibited by any 
polypeptide that comprises the same domain (s). 

Specific domains within the MLS encoded polypeptides are 
indicated by the reference REF TABLES 1 and 2. In addition, 
the domains within the MLS encoded polypeptide can be defined 
by the region that exhibits at least 70% sequence identity 
with the consensus sequences listed in the detailed 
description below of each of the domains. 
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The majority of the protein domain descriptions given 
below are obtained from Prosite, 
(http/ /www . expasy . ch/prosite/ ) , and Pfam, 
(http//pf am. wustl . edu /browse . shtml ) . 
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1. (AAA) AAA-protein family signature 

A large family of ATPases has been described [1 to 5] whose 
key feature is that they share a conserved region of about 
220 amino acids that contains anATP-binding site. This family 
is now called AAA, for 'A'TPases ' A' ssociated with diverse 
cellular 1 A 1 ct ivities . The proteins that belong to this 
family either contain one or two AAA domains. Proteins 
containing two AAA domains: 

- Mammalian and drosophila NSF (N-ethylmaleimide-sensitive 
fusion protein) and the fungal homolog, SEC18. These 
proteins are involved in intracellular transport between 
the endoplasmic reticulum and Golgi, as well as between 
different Golgi cisternae. 

- Mammalian transitional endoplasmic reticulum ATPase 
(previously known as p97 or VCP) which is involved in the 
transfer of membranes from the endoplasmic reticulum to the 
golgi apparatus. This protein forms a ring-shaped 
homooligomer composed of six subunits. The yeast homolog is 
CDC48 and it may play a role in spindle pole proliferation. 

- Yeast protein PAS1, essential for peroxisome assembly and 
the related protein PAS1 from Pichia pastoris. 

- Yeast protein AFG2 . 

- Sulfolobus acidocaldarius protein SAV and Halobacterium 
salinarium cdcH which may be part of a transduction pathway 
connecting light to cell division. 

Proteins containing a single AAA domain: 

- Escherichia coli and other bacteria ftsH (or hflB) protein. 
FtsH is an ATP-dependent zinc metallopeptidase that seems 
to degrade the heat-shock sigma-32 factor. 

It is an integral membrane protein with a large cytoplasmic 
C-terminal domain that contain both the AAA and the protease 
domains . 
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- Yeast protein YME1, a protein important for maintaining the 
integrity of the mitochondrial compartment. YME1 is also a 
zinc-dependent protease . 

- Yeast protein AFG3 (or YTA10) . This protein also seems to 
contain a AAA domain followed by a zinc-dependent protease 
domain . 

Subunits from the regulatory complex of the 26S 
proteasome [6] which is involved in the ATP-dependent 
degradation of ubiquitinated proteins: 

a) Mammalian subunit 4 and homologs in other higher 
eukaryotes, in yeast (gene YTA5) and fission yeast 
(gene mts2 ) . 

b) Mammalian subunit 6 (TBP7) and homologs in other 
higher eukaryotes and in yeast (gene YTA2 ) . 

c) Mammalian subunit 7 (MSS1) and homologs in other 
higher eukaryotes and in yeast (gene CIM5 or YTA3) . 

d) Mammalian subunit 8 (P45) and homologs in other 
higher eukaryotes and in yeast (SUG1 or CIM3 or TBYl) 
and fission yeast (gene letl) . 

Other probable subunits such as human TBP1 which seems 
to influences HIV gene expression by interacting with the 
virus tat transactivator protein and yeast YTA1 and YTA6 . 

Yeast protein BCS1, a mitochondrial protein essential 
for the expression of the Rieske iron-sulfur protein. 
Yeast protein MSP1, a protein involved in 
intramitochondrial sorting of proteins. 
Yeast protein PAS 8 , and the corresponding proteins 
PAS 5 from Pichia pastoris and PAY 4 from Yarrowia 
lipolytica . 

Mouse protein SKD1 and its fission yeast homolog 
(SpAC2Gll .06) . 

Caenorhabditis elegans meiotic spindle formation 
protein mei-1. 
Yeast protein SAP1. 
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Yeast protein YTA7 . 

Mycobacterium leprae hypothetical protein A2126A. 
It is proposed that, in general, the AAA domains in 
these proteins act as ATP- dependent protein clamps [5] . In 
addition to the ATP-binding f A ! and ' B 1 motifs, which are 
located in the N-terminal half of this domain, there is a 
highly conserved region located in the central part of the 
domain which was used to develop a signature pattern. 

Consensus pattern: [LIVMT] -x- [LIVMT] - [LIVMF] -x- [GATMC] - [ST] - 
[NS] -x (4) - [LIVM] - D-x-A- [LI FA] -x-R 

[1] Froehlich K.-U., Fries H.W., Ruediger M . , Erdmann R. , 

Botstein D. , Mecke D. J . Cell Biol. 114:443-453(1991). 

[2] Erdmann R., Wiebel F.F., Flessau A., Rytka J., Beyer A., 

Froehlich K.-U., Kunau W.-H. Cell 64:4 99-510(1991). 

[3] Peters J.-M., Walsh M.J., Franke W.W. EMBO J. 9:1757- 

1767 (1990) . 

[4] Kunau W.-H., Beyer A., Goette K. , Marzioch M . , Saidowsky 
J., Skaletz-Rorowski A., Wiebel F.F. Biochimie 75:209- 
224 (1993) . 

[5] Confalonieri F . , Duguet M. BioEssays 17 : 639-650 (1995) . [ 
6] Hilt W., Wolf D.H. Trends Biochem. Sci. 21:96-102(1996). 

2. Aminotransferases class-IV signature 

Aminotransferases share certain mechanistic features 
with other pyridoxal-phosphate dependent enzymes, such as the 
covalent binding of the pyridoxal-phosphate group to a lysine 
residue. On the basis of sequence similarity, these various 
enzymes can be grouped [1,2] into subfamilies. One of these, 
called class-IV, currently consists of the following enzymes: 
Branched-chain amino-acid aminotransferase (EC 
2.6.1.42 ) (transaminase B) , a bacterial (gene ilvE) 
and eukaryotic enzyme which catalyzes the reversible 
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transfer of an amino group from 4-methyl-2- 
oxopentanoate to glutamate, to form leucine and 2- 
oxoglutarate . 

D-alanine aminotransferase (EC 2,6-1.21 ) . A bacterial 
enzyme which catalyzes the transfer of the amino group 
from D-alanine (and other D-amino acids) to 2- 
oxoglutarate, to form pyruvate and D-aspartate. 
4-amino-4-deoxychorismate (ADC) lyase (gene pabC) . A 
bacterial enzyme that converts ADC into 4- 
aminobenzoate (PABA) and pyruvate. 
The above enzymes are proteins of about 270 to 415 
amino-acid residues that share a few regions of sequence 
similarity. Surprisingly, the best-conserved region does not 
include the lysine residue to which the pyridoxal- 
phosphategroup is known to be attached, in ilvE. The region 
that has been selected as a signature pattern is located some 
40 residues at the C-terminus side of the PIP-lysine 

Consensus pattern: E-x- [STAGCI] -x (2) -N- [LIVMFAC] - [FY] - 
x (6, 12) - [LIVMF] -x-T- x (6, 8) - [LIVM] -x- [GS] - [LIVM] -x- [KR] - 

[1] Green J.M., Merkel W.K., Nichols B.P. J. Bacteriol. 
174 : 5317-5323 (1992) . 

[2] Bairoch A. Unpublished observations (1992) . 
3. Bacterial mutT domain signature 

The bacterial mutT protein is involved in the GO system [1] 
responsible for removing an oxidatively damaged form of 
guanine ( 8 -hydroxyguanine or7, 8-dihydro-8-oxoguanine) from 
DNA and the nucleotide pool. 8-oxo-dGTP is inserted opposite 
to dA and dC residues of template DNA with almost equal 
efficiency thus leading to A.T to G.C transversions . MutT 
specifically degrades 8-oxo-dGTP to the monophosphate with 
the concomitant release of pyrophosphate. MutT is a small 



WO 00/40695 PCT/US00/00466 

56 

protein of about 12 to 15 Kd. It has been shown [2,3] that a 
region of about 40 amino acid residues, which is found in the 
N-terminal part of mutT, can also be found in a variety of 
other prokaryotic, viral, and eukaryotic proteins. These 
proteins are: 

Streptomyces pneumoniae mutX. 

A mutT homolog from plasmid pSAM2 of Streptomyces 
ambof aciens . 

Bartonella bacillif ormis invasion protein A (gene 
invA) . 

Escherichia coli dATP pyrophosphohydrolase . 
Protein D250 from African swine fever viruses. 
Proteins D9 and D10 from a variety of poxviruses. 
Mammalian 7 , 8-dihydro-8 -oxoguanine triphosphatase (EC 
3.1.6.-) [4] . 

Mammalian di adenosine 5 1 , 5 1 1 f -PI , P4-tetraphosphate 
asymmetrical hydrolase (Ap4Aase) (EC 3.6.1.17 ) [5], 
which cleaves A-5 1 -PPPP-5 ■ A to yield AMP and ATP. 
A protein encoded on the antisense RNA of the basic 
fibroblast growth factor gene in higher vertebrates. 
Yeast protein YSA1. 

Escherichia coli hypothetical protein yfaO. 

Escherichia coli hypothetical protein ygdU and HI0901 

the corresponding Haemophilus influenzae protein. 

Escherichia coli hypothetical protein yjaD and HI0432 

the corresponding Haemophilus influenzae protein. 

Escherichia coli hypothetical protein yrfE. 

Bacillus subtilis hypothetical protein yqkG. 

Bacillus subtilis hypothetical protein yzgD. 

Yeast hypothetical protein YGL067w. 
It is proposed [2] that the conserved domain could be 
involved in the active center of a family of pyrophosphate- 
releasing NTPases . As a signature pattern the core region of 
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the domain was selected; it contains four conserved glutamate 
residues . 

Consensus pattern: G-x (5) -E-x (4) - [STAGC] - [LIVMAC] -x-R-E- 
[LIVMFT] -x-E-E- 

[1] Michaels M.L., Miller J.H. J. Bacterid. 174:6321- 
6325 (1992) . 

[2] Koonin E.V. Nucleic Acids Res. 21:4847-4847(1993). 

[3] Mejean V., Salles C, Bullions M.J., Bessman M.J., 
Claverys J. -P. Mol . Microbiol. 11:323-330(1994). 

[4] Sakumi K., Furuichi M., Tsuzuki T . , Kakuma T . , Kawabata 
S., Maki H., Sekiguchi M. J. Biol. Chem. 268:23524- 
23530 (1993) . 

[5] Thorne N.M.H., Hankin S., Wilkinson M.C., Nunez C, 
Barraclough R. , McLennan A.G. Biochem. J. 311:717-721(1995). 

4 . Cystatin domain 

This is a very diverse family. Attempts to define separate 
subfamilies have failed. Typically, either the N-terminal or 
C-terminal end is very divergent. But splitting into two 
domains would make very short families. Cathelicidins are 
related to this family but have not been included. Number of 
members: 147 

Inhibitors of cysteine proteases [1,2,3], which are 
found in the tissues and body fluids of animals, in the larva 
of the worm Onchocerca volvulus [4], as well as in plants, 
can be grouped into three distinct but related families: 

Type 1 cystatins (or stefins) , molecules of about 100 
amino acid residues with neither disulfide bonds nor 
carbohydrate groups . 

Type 2 cystatins, molecules of about 115 amino acid 
residues which contain one or two disulfide loops near 
their C-terminus. 
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Kininogens, which are multifunctional plasma 
glycoproteins . 

They are the precursor of the active peptide bradykinin 
and play a role in blood coagulation by helping to position 
5 optimally prekallikrein and factor XI next to factor XII. 

They are also inhibitors of cysteine proteases. Structurally, 
kininogens are made of three contiguous type-2 cystatin 
domains, followed by an additional domain (of variable 
length) which contains the sequence of bradykinin. The first 
10 of the three cystatin domains seems to have lost its 
inhibitory activity. 

In all these inhibitors, there is a conserved region of 
five residues which has been proposed to be important for the 
binding to the cysteine proteases. The consensus pattern 
15 starts one residue before this conserved region. 



-Consensus pattern: [ GSTEQKRV] -Q- [LIVT] - [VAF] - [SAGQ] -G-x- 
[LIVMNK] -x (2) - [LIVMFY] -x- [LIVMFYA] - [DENQKRHSIV] 

20 [1] Barrett A.J. Trends Biochem. Sci. 12:193-196(1987). 

[2] Rawlings N.D., Barrett A.J. J. Mol . Evol . 30:60-71(1990). 
[3] Turk V., Bode W. FEBS Lett. 285:213-219(1991). 
[4] Lustigman S., Brotman B., Huima T., Prince A.M. Mol. 
Biochem. Parasitol. 45:65-7 6(1991) . 

25 

5. Dehydrins signatures 

A number of proteins are produced by plants that experience 
water-stress. Water-stress takes place when the water 
available to a plant falls below a critical level. The plant 
30 hormone abscisic acid (ABA) appears to modulate the response 
of plant to water-stress. Proteins that are expressed during 
water-stress are called dehydrins [1,2] or LEA group 2 
proteins [3]. The proteins that belong to this family are 
listed below. 
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- Arabidopsis thaliana XERO 1, XERO 2 (LTI30), RAB18, 
ERD10 (LTI45) ERD14 and COR47. 

- Barley dehydrins B8, B9, B17, and B18. 

- Cotton LEA protein D-ll. 

Craterostigma plantagineum dessication-related 
proteins A and B. 

- Maize dehydrin M3 (RAB-17) . 

- Pea dehydrins DHN1, DHN2, and DHN3. 
Radish LEA protein. 

- Rice proteins RAB 16B, 16C, 16D, RAB21, and RAB25. 
Tomato TAS14. 

Wheat dehydrin RAB 15 and cold-shock protein cor410, 

cs66 and csl20 . 

Dehydrins share a number of structural features. 
One of the most notable features is the presence, in 
their central region, of a continuous run of five to 
nine serines followed by a cluster of charged residues. 
Such a region has been found in all known dehydrins so 
far with the exception of pea dehydrins. A second 
conserved feature is the presence of two copies of 
alysine-rich octapeptide; the first copy is located just 
after the cluster of charged residues that follows the 
poly-serine region and the second copy is found at the 
C-terminal extremity. Signature patterns for both 
regions were derived. 

Consensus pattern: S (5) - [DE] -x- [DE] -G-x (1, 2) -G-x (0, 1) - [KR] (4 
Consensus pattern: [KR] - [LIM] -K- [DE] -K- [LIM] -P-G- 

[1] Close T.J., Kortt A. A. , Chandler P.M. Plant Mol. Biol. 
13: 95-108 (1989) . 

[2] Robertson M . , Chandler P.M. Plant Mol. Biol. 19:1031- 
1044 (1992) . 
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[3] Dure L. Ill, Crouch M. , Harada J., Ho T.-H. D., Mundy J., 
Quatrano R. , Thomas T . , Sung Z.R. Plant Mol . Biol. 12:475- 
486 (1989) . 

5 6. D-isomer specific 2 -hydroxyacid dehydrogenases (2 Hacid 
DH) 

This Pfam covers the Formate dehydrogenase, D-glycerate 
dehydrogenase and D-lactate dehydrogenase families in SCOP. A 
number of NAD-dependent 2-hydroxyacid dehydrogenases which 
10 seem to be specific for the D-isomer of their substrate have 
been shown [1,2,3,4] to be functionally and structurally 
related. These enzymes are listed below. 

D-lactate dehydrogenase (EC 1.1.1.28), a bacterial 
enzyme which catalyzes the reduction of D-lactate to 
15 pyruvate. 

D-glycerate dehydrogenase (EC 1.1.1.29) (NADH- 
dependent hydroxypyruvate reductase) , a plant leaf 
peroxisomal enzyme that catalyzes the reduction of 
hydroxypyruvate to glycerate. This reaction is part 
20 of the glycolate pathway of photorespiration . 

D-glycerate dehydrogenase from the bacteria 
Hyphomicrobium methylovorum and Methylobacterium 
extorquens . 

3-phosphoglycerate dehydrogenase (EC 1.1.1.95), a 
25 bacterial enzyme that catalyzes the oxidation of D-3- 

phosphoglycerate to 3-phosphohydroxypyruvate . This 
reaction is the first committed step in the 
1 phosphorylated 1 pathway of serine biosynthesis. 
Erythronate-4 -phosphate dehydrogenase (EC 1.1.1.-) 
30 (gene pdxB) , a bacterial enzyme, involved in the 

biosynthesis of pyridoxine (vitamin B6) . 
D-2-hydroxyisocaproate dehydrogenase (EC 1.1.1.-) (D- 
hicDH) , a bacterial enzyme that catalyzes the 
reversible and stereospecif ic interconversion between 
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2-ketocarboxylic acids and D-2-hydroxy-carboxylic 
acids • 

Formate dehydrogenase (EC 1.2.1.2) (FDH) from the 

bacteria Pseudomonas sp . 101 and various fungi [5]. 
5 - Vancomycin resistance protein vanH from Enterococcus 

faecium; this protein is a D-specific alpha-keto acid 

dehydrogenase involved in the formation of a 

peptidoglycan which does not terminate by D-alanine 

thus preventing vancomycin binding. 
10 - Escherichia coli hypothetical protein ycdW. 

Escherichia coli hypothetical protein yiaE. 

Haemophilus influenzae hypothetical protein HI1556. 

Yeast hypothetical protein YER081w. 

Yeast hypothetical protein YIL074w. 
15 All these enzymes have similar enzymatic activities and 

are structurally related. Three of the most conserved regions 
of these proteins have been selected to develop patterns. The 
first pattern is based on a glycine-r ich region located in 
the central section of these enzymes; this region probably 
20 corresponds to the NAD-binding domain. The two other patterns 
contain a number of conserved charged residues, some of which 
may play a role in the catalytic mechanism. 

-Consensus pattern: [LIVMA] - [AG] - [IVT] - [LIVMFY] - [AG] -x-G- 
25 [NHKRQGSAC] - [L1V] -G-x (13, 14 ) - [LIVfMT] -x (2) - [FYwCTH] - [DNSTK] 
-Consensus pattern: [LIVMFYWA] - [LIVFYWC] -x (2) - [SAC] - [DNQHR] - 
[IVFA] - [LIVF] -x- [LIVF] - [HNI] -x-P-x (4 ) - [STN] -x (2) - [LIVMF] -x- 
[GSDN] 

-Consensus pattern: [LMFATC] - [KPQ] -x- [GSTDN] -x- [LIVMFYWR] - 
30 [LIVMFYW] (2) -N-x- [STAGC] -R- [GP] -x- [LIVH] - [LIVMC] - [DNV] 

[I] Grant G.A. Biochem. Biophys . Res. Coitimun. 165:1371- 
1374 (1989) . 
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[2] Kochhar S., Hunziker P., Leong-Morgenthaler P.M., 
Hottinger H. Biochem. Biophys . Res. Commun. 184:60-66(1992). 
[3] Ohta T., Taguchi H. J. Biol. Chem. 266:12588-12594(1991). 
[4] Goldberg J.D., Yoshida T . , Brick P. J. Mol. Biol. 
5 236: 1123-1140 (1994) . 

[5] Popov V.O., Lamzin V.S. Biochem. J. 301:625-643(1994). 

7. dnaJ domains signatures and profile 

The prokaryotic heat shock protein dnaJ interacts with 

10 the chaperone hsp70-like dnaK protein [1]. Structurally, the 
dnaJ protein consists of an N- terminal conserved domain 
(called f J f domain) of about 70 amino acids, a glycine-rich 
region ( ! G f domain') of about 30 residues, a central domain 
containing four repeats of a CXXCXGXG motif ('CRR' domain) 

15 and a C-terminal region of 120 to 170 residues. Such a 

structure is shown in the following schematic representation: 

+ | N-terminal I I Gly-R I I CXXCXGXG | C- 

terminal | + + -+ + + + 

20 + 

It has been shown [2] that the f J f domain as well as the 
f CRR' domain are also found in other prokaryotic and 
eukaryotic proteins which are listed below. 

a) Proteins containing both a 'J' and a f CRR' domain: 

25 - Yeast protein MAS5/YDJ1 which seems to be involved in 

mitochondrial protein import. 

Yeast protein MDJ1, involved in mitochondrial 
biogenesis and protein folding. 

Yeast protein SCJ1, involved in protein sorting. 
30 - Yeast protein XD Jl . 

Plants dnaJ homologs (from leek and cucumber) . 
Human HDJ2, a dnaJ homolog of unknown function. 
Yeast hypothetical protein YNL077w. 

b) Proteins containing a ! J' domain without a f CRR* domain: 
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Rhizobium fredii nolC, a protein involved in cultivar- 
specific nodulation of soybean. 

Escherichia coli cbpA [3], a protein that binds curved 
DNA. 

5 - Yeast protein SEC63/NPL1, important for protein 

assembly into the endoplasmic reticulum and the 
nucleus . 

Yeast protein SIS1, required for nuclear migration 

during mitosis. 
10 - Yeast protein CAJ1 . 

Yeast hypothetical protein YFR041c. 

Yeast hypothetical protein YIR004w. 

Yeast hypothetical protein YJLl62c. 

Plasmodium falciparum ring-infected erythrocyte 
15 surface antigen (RESA) . RESA, whose function is not 

known, is associated with the membrane skeleton of 

newly invaded erythrocytes. 

Human HDJ1. 

Human HSJ1, a neuronal protein. 
20 - Drosophila cysteine-string protein (csp) . 

A signature pattern for the 1 J 1 domain was developed, 
based on conserved positions in the C-terminal half of this 
domain. A pattern for the 1 CRR 1 domain, based on the first 
two copies of that motif was also developed. A profile for 
25 the 1 J f domain was also developed. 

Consensus pattern: [FY] -x (2) - [LIVMA] -x (3) - [ FYWHIMT] - [DENQSA] - 
x-L-x- [ DN] -x (3) - [KR] -x (2) - [FYI] - 

Consensus pattern: C- [DEGSTHKR] -x-C-x-G-x- [GK] - [AGS DM] -x (2) - 
30 [GSNKR] -x (4, 6) -C- x ( 2 , 3 ) -C-x-G-x-G- 



[1] Cyr D.M., Langer T . , Douglas M.G. Trends Biochem. Sci. 
19: 176-181 (1994) . 
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[2] Bork P., Sander C, Valencia A., Bukau B. Trends Biochem. 
Sci. 17: 129-129 (1992) . 

[3] Ueguchi C, Kaneda M. , Yamada H., Mizuno T. Proc. Natl. 
Acad. Sci. U.S.A. 91: 10 54-1058 (19 94). 

5 

8. Domain of unknown function 

9. Gamma-thionins family signature 

The following small plant proteins are evolutionary 

10 related: 

Gamma-thionins from wheat endosperm (gamma- 
purothionins ) and barley (gamma- hordothionins ) which 
are toxic to animal cells and inhibit protein 
synthesis in cell free systems [1] . 

15 A flower-specific thionin (FST) from tobacco [2]. 

Antifungal proteins (AFP) from the seeds of 
Brassicaceae species such as radish, mustard, turnip 
and Arabidopsis thaliana [3] . 

Inhibitors of insect alpha-amylases from sorghum [4], 
20 - Probable protease inhibitor P322 from potato. 

A germination-related protein from cowpea [5] . 
Anther-specific protein SF18 from sunflower [6]. SF18 
is a protein that contains a gamma-thionin domain at 
its N-terminus and a proline-rich C- terminal domain. 
25 - Soybean sulfur-rich protein SE60 [7] . 

Vicia faba antibacterial peptides fabatin-1 and -2. 
In their mature form, these proteins generally consist 
of about 45 to 50amino-acid residues. As shown in the 
following schematic representation, these peptides contain 
30 eight conserved cysteines involved in disulfide bonds. 

+ _ + | + 

+ I I I I I 

xxCxxxxxxxxxxCxxxxxCxxxCxxxxxxxxxCxxxxxxCxCxxxC 

*******************!***! I I _j I l_ I _J 

35 + 
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'C : conserved cysteine involved in a disulfide bond. 
1 * 1 : position of the pattern. 

Consensus pattern: [KRG] -x-C-x (3) - [SV] -x (2) - [FYWH] -x- [GF] -x- 
5 C-x (5) -C-x (3) -C [The four C's are involved in disulfide 
bonds ] - 

[1] Bruix M., Jimenez M.A., Santoro J., Gonzalez C, Colilla 
F.J., Mendez E . , Rico M. Biochemistry 32:715-724(1993). 
10 [2] Gu Q . , Kawata E.E., Morse M.-J., Wu H.-M., Cheung A.Y. 
Mol. Gen. Genet. 234:89-96(1992). 

[3] Terras F.R.G., Torrekens S., van Leuven F . , Osborn R.W., 
Vanderleyden J. , Cammue B.P.A., Broekaert W.F. FEBS Lett. 
316:233-240 (1993) . 
15 [4] Bloch C. Jr., Richardson M. FEBS Lett. 279:101-104(1991). 
[5] Ishibashi N., Yamauchi D., Miniamikawa T. Plant Mol. 
Biol. 15:59-64 (1990) . 

[7] Choi Y . , Choi Y.D., Lee J.S. Plant Physiol. 101:699- 
700 (1993) . 

20 

10. haloacid dehalogenase-li ke hydrolase 

This family is structurally different from the alpha/ 
beta hydrolase family ( abhydrolase ) . This family includes L- 
2-haloacid dehalogenase, epoxide hydrolases and phosphatases. 

25 The structure of the family consists of two domains. One is 
an inserted four helix bundle, which is the least well 
conserved region of the alignment, between residues 16 and 96 
of Swiss : P24069 . The rest of the fold is composed of the core 
alpha/beta domain . 

30 [1] Hisano T, Hata Y, Fujii T, Liu JQ, Kurihara T, Esaki N, 
Soda K, J Biol Chem 1996; 271:20322-20330. 



11. Helix-turn-helix (HTH3) 
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This large family of DNA binding helix-turn helix 
proteins includes Cro Swiss : P03036 and CI Swiss : P03034 . 

12. Heme-binding domain in cytochrome b5 and oxidoreductases 
5 (heme_l) 

Cytochrome b5 is a membrane-bound hemo protein which 
acts as an electron carrier for several membrane-bound 
oxygenases [1]. There are two homologous forms of b5, one 
found in microsomes and one found in the outer membrane of 

10 mitochondria. Two conserved histidine residues serve as axial 
ligands for the heme group. The structure of a number of 
oxidoreductases consists of the juxtaposition of a heme- 
binding domain homologous to that of b5 and either a 
f lavodehydrogenase or a molybdopterin domain. These enzymes 

15 are: 

Lactate dehydrogenase (EC 1.1.2.3 ) [2], an enzyme that 
consists of a f lavodehydrogenase domain and a heme- 
binding domain called cytochrome b2 . 
Nitrate reductase (EC 1.6.6.1 ) , a key enzyme involved 
20 in the first step of nitrate assimilation in plants, 

fungi and bacteria [3,4]. Consists of a molybdopterin 
domain (see < PDOC00484 >) , a heme-binding domain called 
cytochrome b557, as well as a cytochrome reductase 
domain . 

25 - Sulfite oxidase (EC 1.8.3.1 ) [5], which catalyzes the 

terminal reaction in the oxidative degradation of 
sulfur-containing amino acids. Also consists of a 
molybdopterin domain and a heme-binding domain. 
This family of proteins also includes: 

30 - TU-36B, a Drosophila muscle protein of unknown 

function [ 6 ] . 

Fission yeast hypothetical protein SpACl F12 . 1 0c . 
Yeast hypothetical protein YMR073c. 
Yeast hypothetical protein YMR272c. 
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A segment was used which includes the first of the two 
histidine heme ligands, as a signature pattern for the heme- 
binding domain of cytochrome b5 family. 

5 Consensus pattern: [FY] - [LIVMK] -x (2 ) -H-P- [GA] -G [H is a heme 
axial ligand] - 

[1] Ozols J. Biochim. Biophys . Acta 997:121-130(1989). 

[2] Guiard B. EMBO J. 4:3265-3272(1985). 
10 [3] Calza R., Huttner E., Vincentz M., Rouze P., Galangau F. , 
Vaucheret H., Cherel I., Meyer C, Kronenberger J., Caboche 
M. Mol. Gen. Genet. 209:552-562(1987). 

[4] Crawford N.M., Smith M., Bellissimo D. , Davis R.W. Proc. 
Natl. Acad. Sci. U.S.A. 85:5006-5010(1988). 
15 [5] Guiard B., Lederer F. Eur. J. Biochem. 100:441-453(1979). 
[6] Levin R.J., Boychuk P.L., Croniger CM., Kazzaz J. A., 
Rozek C.E. Nucleic Acids Res. 17:6349-6367(1989). 

13. KH domain 

20 KH motifs probably bind RNA directly. Auto antibodies to 

Nova, a KH domain protein, cause paraneoplastic opsoclonus 
ataxia . 

[1] Burd CG, Dreyfuss G, Science 19 94;265:615-621. 
[2] Musco G, Stier G, Joseph C, Castiglione Morelli MA, 
25 Nilges M, Gibson TJ, Pastore A, Cell 1996;85:237-245. 

14. MAPEG family (aka: FLAP/ GST2/LTC4S family signature) 

The following mammalian proteins are evolutionary 
related [ 1 ] : 

30 - Leukotriene C4 synthase (EC 2.5.1.37) (gene LTC4S), an 

enzyme that catalyzes the production of LTC4 from 
LTA4 . 
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Microsomal glutathione S-transf erase II (EC 2.5.1.18) 
(GST-II) (gene GST2), an enzyme that can also produces 
LTC4 fron LTA4 . 

5-lipoxygenase activating protein (gene FLAP) , a 
5 protein that seems to be required for the activation 

of 5-lipoxygenase . 
These are proteins of 150 to 160 residues that contain 
three transmembrane segments. As a signature pattern, a 
conserved region between the first and second transmembrane 
10 domains was selected. 

Consensus patternc: G-x (3) -F-E-R-V- [FY] -x-A- [NQ] -x-N-C 

[1] Jakobsson P. -J., Mancini J. A. , Ford-Hutchinson A.W. J. 
15 Biol. Chem. 271:22203-22210(1996) . 

15. Pathogenesis-related protein Bet v I family signature 

A number of plant proteins, which all seem to be 
involved in pathogen defense response, a^e structurally 
20 related [1,2,3]. These proteins are: 

Bet v I, the major pollen allergen from white birch. 

Bet v I is the main cause of type I allergic reactions 

in Europe, North America and USSR. 

Aln g I, the major pollen allergen from alder. 
25 - Api G I, the major allergen from celery. 

Car b I, the major pollen allergen from hornbeam. 

Cor a I, the major pollen allergen from hazel. 

Mai d I, the major pollen allergen from apple. 

Asparagus wound-induced protein AoPRl. 
30 - Kidney bean pathogenesis-related proteins 1 and 2. 

Parsley pathogenesis-related proteins PR1-1 and PRl-3. 

Pea disease resistance response proteins pI49, pI176 

and DRRG4 9-C. 

Pea abscisic acid-responsive proteins ABR17 and ABR18. 
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Potato pathogenesis-related proteins STH-2 and STH-21. 
Soybean stress-induced protein SAM22. 
These proteins are thought to be intracellularly 
located. They contain from 155 to 160 amino acid residues. As 
5 a signature pattern, a conserved region located in the third 
quarter of these proteins has been selected 
Consensus pattern: G-x (2) - [LIVMF] -x (4) -E-x (2) - [CSTAEN] - 
x (8, 9) - [GND] -G- [GS] - [CS] -x (2) -K-x (4) - [FY] - 

10 [1] Breiteneder H., Pettenburger K., Bito A . , Valenta R., 
Kraft D., Rumpold H., Scheiner O., Breitenbach M. EMBO J. 
8 : 1935-1938 (1989) . 

[2] Crowell D., John M.E., Russell D. , Amasino R.M. Plant 
Mol. Biol. 18:459-466(1992) . 
15 [3] Warner S.A.J. , Scott R., Draper J. Plant Mol. Biol. 
19:555-561 (1992) . 

16. Photosystem I psaG / psaK (PSI PSAK) proteins signature 
Photosystem I (PSI) [1] is an integral membrane protein 

20 complex that uses light energy to mediate electron transfer 
from plastocyanin to ferredoxin. It is found in the 
chloroplasts of plants and cyanobacteria . PSI is composed of 
at least 14 different subunits. two of which PSI-G (gene 
psaG) and PSI-K (gene psaK) are small hydrophobic proteins of 

25 about 7 to 9 Kd and evolutionary related [2] . Both seem to 
contain two transmembrane regions. Cyanobacteria seem to 
encode only for PSI-K. 

As a signature pattern, the best-conserved region was 
selected which seems to correspond to the second 

30 transmembrane region. 

-Consensus pattern: [GT] -F-x- [LIVM] -x- [DEA] -x (2) - [GA] -x- 
[GTA] - [SA] -x-G-H-x- [LIVM] - [GA] 
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[1] Golbeck J.H. Biochim. Biophys. Acta 8 95:167-204(1987). 
[2] Kjaerulff S., Andersen B., Nielsen V.S., Moller B.L., 
Okkels J.S. J. Biol. Chem. 2 68:18 912-18916(1993). 

5 17. Plant lipid transfer protein family signature (LTP) 
Plant cells contain proteins, called lipid transfer 
proteins (LTP) [1,2,3], which are able to facilitate the 
transfer of phospholipids and other lipidsacross membranes. 
These proteins, whose subcellular location is not yet known, 

10 could play a major role in membrane biogenesis by conveying 
phospholipids such as waxes or cutin from their site of 
biosynthesis to membranes unable to form these lipids. Plant 
LTP's are proteins of about 9 Kd (90 amino acids) which 
contain eight conserved cysteine residues all involved in 

15 disulfide bridges, as shown in the following schematic 
representation . 

+ + , + + | | | | | 

**************** 

xCxxxxCxxxxxxCCxxxxxxxxCxCxxxxxxxxxxxCxxxxxxCxx I I I I + 

20 | + | + + 

f C ! : conserved cysteine involved in a disulfide bond. 
! *': position of the pattern. 

Consensus pattern: [LIVM] - [ PA] -x (2) -C-x- [LIVM] -x- [LIVM] -x- 
25 [LIVMFY] -x- [LIVM] - [ST] -x ( 3) - [DN] -C-x (2 ) - [LIVM] [The two C's 
are involved in disulfide bonds] 

[1] Wirtz K.W.A. Annu. Rev. Biochem. 60:73-99(1991). 
[2] Arondel V., Kader J.C. Experientia 4 6:579-585(1990). 
30 [3] Ohlrogge J.B., Browse J., Somerville C.R. Biochim. 
Biophys. Acta 1082:1-26(1991). 



18. Ribosomal protein S7e signature 
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A number of eukaryotic ribosomal proteins can be grouped 
on the basis of sequence similarities [1]. One of these 
families consists of: 
Mammalian S7 . 
5 - Xenopus S8 . 

Insect S7. 

Yeast probable ribosomal protein S7 (N2212) . 
Fission yeast probable ribosomal protein S7 
(SpAC18G6. 13c) . 

10 These proteins have about 200 amino acids. A highly conserved 
stretch of 14 residues which is located in the central 
section and which is rich in charged residues was selected as 
a signature pattern. 

15 Consensus pattern: [KR] -L-x-R-E-L-E-K-K-F- [SAP] -x- [KR] -H 

[1] Salazar C.E., Mills-Hamm D.M., Kumar V., Collins F.H. 
Nucleic Acids Res. 21:4147-41*7(1993). 

20 19. Ribosomal protein L34 signature 

Ribosomal protein L34 is one of the proteins from the 
large subunit of the prokaryotic ribosome. It is a small 
basic protein of 44 to 51 amino-acid residues [1] . L34 
belongs to a family of ribosomal proteins which, on the basis 
25 of sequence similarities, groups: - Eubacterial L34 . 
- Red algal chloroplast L34 . - Cyanelle L34. 
A conserved region that corresponds to the N-terminal half of 
L34 has been selected 
as a signature pattern. 
30 -Consensus pattern: K- [RG] -T- [FYWL] - [EQS] -x (5) - [KRHS] -x (4 , 5) - 
G-F-x (2) -R 

[ 1] Old I.G., Margarita D., Saint Girons I. 
Nucleic Acids Res. 20:6097-6097(1992). 
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20. Ribosomal protein L6 signatures 

Ribosomal protein L6 is one of the proteins from the 
large ribosomal subunit. In Escherichia coli, L6 is known to 
bind directly to the 23S rRNA and is located at the 
5 aminoacyl-tRNA binding site of the peptidyltransf erase 

center. It belongs to a family of ribosomal proteins which, 
on the basis of sequence similarities [1,2,3,4], groups: - 
Eubacterial L6 . 

Algal chloroplast L6. 
10 - Cyanelle L6. 

Archaebacterial L6. 

Marchantia polymorpha mitochondrial L6. 
Yeast mitochondrial YmL6 (gene MRPL6) . 
Mammalian L9. 
15 - Drosophila L9. 

Plants L9. 
- Yeast L9 (YL11) . 

While all the above proteins are evolutionary related it 
is very difficult to derive a pattern that will find them 

20 all. Two patterns were therefore created, the first to detect 
eubacterial, cyanelle and mitochondrial L6, the second to 
detect archaebacterial L6 as well as eukaryotic L9. 
-Consensus pattern: [PS] - [DENS] -x-Y-K- [GA] -K-G- [LIVM] 
-Consensus pattern: Q-x (3) - [LIVM] -x (2) - [KR] -x (2) -R-x-F-x- D-G- 

25 [LIVM] -Y- [LIVM] -x (2) - [KR] 

[1] Suzuki K., Olvera J., Wool I.G. Gene 93:2 97-300(1990). 
[2] Schwank S., Harrer R., Schueller H.-J., Schweizer E. 
Curr . Genet. 24:13 6-140(19 93) . 
30 [3] Golden B.L., Ramakrishnan V., White S.W. EMBO J. 12:4901- 
4908 (1993) . 

[ 4] Otaka E., Hashimoto T . , Mizuta K., Suzuki K. Protein 
Seq. Data Anal. 5:301-313(1993). 
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21. Ribosomal protein S14p/S29e (Ribosomal protein S14 
signature) 

Ribosomal protein S14 is one of the proteins from the 
small ribosomal subunit. In Escherichia coli, S14 is known 
5 to be required for the assembly of 30S particles and may also 
be responsible for determining the conformation of 16S rRNA 
at the A site. It belongs to a family of ribosomal proteins 
which, on the basis of sequence similarities [1,2], groups: 

- Eubacterial S14. 

10 - Algal and plant chloroplast S14. 

- Cyanelle S14. 

Archaebacterial Methanococcus vannielii S14. 
Plant mitochondrial S14. 
Yeast mitochondrial MRP2 . 
15 - Mammalian S29. 

- Yeast YS29A/B. 

S14 is a protein of 53 to 115 amino-acid residues. Our 
signature pattern is based on the few conserved positions 
located in the center of these proteins. 

20 

Consensus pattern: [RP] -x (0, 1) -C-x (11, 12) - [LIVMF] -x- [LIVMF] - 
[SC] - [RG] -x (3) - [RN] 

[1] Chan Y.-L., Suzuki K. , Olvera J., Wool I.G. Nucleic Acids 
25 Res. 21:649-655(1993) . 

[2] Otaka E., Hashimoto T., Mizuta K. Protein Seq. Data Anal. 
5:285-300 (1993) . 

22. Ribosomal protein S16 signature 

30 Ribosomal protein S16 is one of the proteins from the 

small ribosomal subunit. It belongs to a family of 
ribosomal proteins which, on the basis of sequence 
similarities [1], groups: 
Eubacterial S16. 
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Algal and plant chloroplast S16. 
- Cyanelle S16. 

Neurospora crassa mitochondrial S24 (cyt-21). 
S16 is a protein of about 100 amino-acid residues. A 
5 conserved region located in the N-terminal extremity of these 
proteins has been selected as a signature pattern. 

Consensus pattern: [LIVMT] -x- [LIVM] - [KR] -L- [STAK] -R-x-G- [AKR] 

10 [1] Otaka E . , Hashimoto T . , Mizuta K. Protein Seq. Data Anal. 
5:285-300 (1993) . 

23. Ribosomal protein S21 signature 

Ribosomal protein S21 is one of the proteins from the 
15 small ribosomal subunit. So far S21 has only been found in 

eubacteria. It is a protein of 55 to 70 amino-acid residues. 
A conserved region in the N-terminal section of the protein 
has been selected as a signature pattern. 

20 Consensus pattern: [DE] -x-A- [LIY] - [KR] -R-F-K- [KR] -x (3) - [KR] 

24. Universal stress protein family (Usp) 

By a wide range of stress conditions members of the Usp 
family are predicted to be related to the MADS-box proteins 
25 transcript fact and bind to DNA [2]. Number of members: 39 

[1] Expression and role of the universal stress protein, 
UspA, of Escherichia coli during growth arrest. Nystrom T, 
Neidhardt FC; Mol Microbiol 1994; 11:537-544. 
30 [2] Sequence analysis of eukaryotic developmental proteins: 
ancient and novel domains. Mushegian AR, Koonin EV; Genetics 
1996; 144 : 817-828 . 
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III . Methods of Modulating Polypeptide Production 

Within the scope of invention are chimeric gene 
constructs wherein the promoter and the structural coding 
sequence and/or other regulatory sequences within said 
constructs are heterologous to each other. "Heterologous 
sequences" are those that are not operatively linked or are 
not contiguous to each other in nature. For example, a promoter 
from corn is considered heterologous to an Arabidopsis coding 
region sequence. Also, a promoter from a gene encoding a 
growth factor from corn is considered heterologous to a 
sequence encoding the corn receptor for the growth factor. 
Regulatory element sequences, such as UTRs or 3' end 
termination sequences that do not originate in nature from the 
same gene as the coding sequence originates from, are 
considered heterologous to said coding sequence. On the other 
hand, elements operatively linked in nature are not 
heterologous. Thus, the promoter and coding portion of a 
corn gene expressing an amino acid transporter are not 
heterologous to each other. 

Such chimeric polynucleotides are of particular interest 
for modulating gene expression in a host cell upon 
transformation of said cell with said chimeric polynucleotide. 

Also within the scope of the invention are DNA molecules, 
whereof at least a part or portion of these DNA molecules are 
presented in SEQ TABLES 1 AND 2 of the present application, and 
wherein the structural coding sequence is under the control of 
its own promoter and/or its own regulatory elements. Such DNA 
molecules are useful for transforming the genome of a host cell 
or an organism regenerated from said host cell. 

Typically, such polynucleotides, whether chimeric or not, 
are "exogenous to" the genome of an individual host cell or 
the organism regenerated fro^m said host cell, such as a plant 
cell, respectively for a plant, when initially or subsequently 
introduced into said host cell or organism, by any means other 
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than by a sexual cross. Examples of means by which this can be 
accomplished are described below, and include Agrobacterium- 
mediated transformation (of dicots - e.g. Salomon et al. EMBO 
J. 3:141 (1984); Herrera-Estrella et al. EMBO J. 2:987 (1983); 
A.C. Vergunst et al, Nucleic Acids Res. 26:11, 2729 (1998); of 
monocots, representative papers are those by Escudero et al., 
Plant J. 10:355 (1996), Ishida et al . , Nature Biotechnology 
14,:745 (1996), May et al., Bio/Technology 13:486 (1995)), 
biolistic methods (Armaleo et al . , Current Genetics 17 : 97 
1990)), electroporation, in planta techniques, and the like. 
Such a plant containing the exogenous nucleic acid is referred 
to here as an Ri generation transgenic plant. Transgenic plants 
which arise from a sexual cross with another parent line or by 
selfing are "descendants or the progeny" of a Ri plant and are 
generally called F n plants or S n plants, respectively, n 
meaning the number of generations. 

The SDFs prepared as described herein can be used to 
prepare expression cassettes useful in a number of techniques 
for suppressing or enhancing expression. 

III. A. Suppression 

Expression cassettes of the invention can be used to 
suppress expression of endogenous genes which comprise the 
SDF sequence. Inhibiting expression can be useful, for 
instance, to tailor the ripening characteristics of a fruit 
(Oeller et al., Science 254 : 437 (1991)) or to influence seed 
size_ (W098 /07842 ) or or to provoke cell ablation (Mariani et 
al., Nature 357: 384-387 (1992). 

As described above, a number of methods can be used to 
inhibit gene expression in plants, such as antisense, 
ribozyme, introduction of "exogenous" genes into a host cell, 
insertion of a polynucleotide sequence into the coding 
sequence and/or the promoter of the endogenous gene of 
interest, and the like. 
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III.A.l. Antisense 

An expression cassette as described above can be 
transformed into host cell or plant to produce an antisense 
strand of RNA. In plant cells, it has been suggested that 
antisense RNA inhibits gene expression by preventing the 
accumulation of mRNA which encodes the enzyme of interest, see, 
e.g., Sheehy et al., Proc. Nat. Acad. Sci . USA, 85:8805 (1988), 
and Hiatt et al., U.S. Patent No. 4,801,340. 

III. A. 2. Ribozymes 

Similarly, ribozyme constructs can be transformed 
into a plant to cleave mRNA and down-regulate translation. 

III. A. 3. Co- Suppress ion 

Another method of suppression is by introducing an 
exogenous copy of the gene to be suppressed. Introduction of 
expression cassettes in which a nucleic acid is configured in 
the sense orientation with respect to the promoter has been 
shown to be an effective means by which to block the 
transcription of target genes. A detailed description of this 
method is described above. 

III. A. 4. Insertion of Sequences into the Gene to 
be Modulated 

Yet another means of suppressing gene expression is 
to insert a polynucleotide into the gene of interest to disrupt 
transcription or translation of the gene. 

Homologous recombination couldeaft be used to target 
a polynucleotide insert to a gene using the Cre-Lox system 
(A.C. Vergunst et al., Nucleic Acids Res. 26:2729 (1998), A.C. 
Vergunst et al., Plant Mol. Biol. 38^:393 (1998), H. Albert et 
al., Plant J". 7:649 (1995)). 
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In addition, random insertion of polynucleotides 
into a host cell genome can also be used to disrupt the gene of 
interest. Azpiroz-Leehan et al., Trends in Genetics 13 : 152 
(1997). In this method, screening for clones from a library 
containing random insertions is preferred for identifying those 
that have polynucleotides inserted into the gene of interest. 
Such screening can be performed using probes and/or primers 
described above based on sequences from SEQ TABLES 1 AND 2, 
fragments thereof, and substantially similar sequence thereto. 
The screening can also be performed by selecting clones or Ri 
plants having a desired phenotype. 

III. A. 5. Promoter Modulation 

Inactivation of the promoter that drives a gene of 
interest can modulate transcription and translation, and 
therefore expression. For example, triple helices can be 
formed using oligonucleotides based on sequences from SEQ 
TABLES 1 AND 2, fragments thereof, and substantially similar 
sequence thereto. The oligonucleotide can be delivered to the 
host cell can bind to the promoter in the genome to form a 
triple helix and prevent transcription. 

Additionally, a vector capable of producing the 
oligonucleotide can be inserted into the host cell to deliver 
the oligonucleotide . 

III. A. 6. Expression of Mutants 

An alternative method for inhibiting gene function 
is through the use of dominant negative mutations. Dominant 
negative mutations produce a mutant polypeptide which is 
capable of competing with the native . polypeptide, but which 
does not produce the native result. Consequently, over 
expression of these mutations can titrate out an undesired 
activity of the native protein. For example, the inactive 
dominant-negative mutant may bind to the same receptor as the 
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native protein, preventing the native protein from activating a 
signal transduction pathway. Alternatively, the dominant- 
negative mutant can be an inactive enzyme still capable of 
binding to the same substrate as the native protein. 

Dominant-negative mutants also can act upon the 
native protein itself to prevent activity. For example, the 
native protein may be active only as a homo-multimer or as one 
subunit of a hetero-mult imer . Incorporation of an inactive 
subunit into the multimer with native subunit (s) can inhibit 
activity. 

Thus, gene function can be modulated by insertion of 
an expression construct encoding a dominant-negative mutant 
into a host cell of interest. 

III.B. Enhanced Expression 

Enhanced expression of a gene of interest in a host cell 
can be accomplished by either (1) insertion of an exogenous 
gene; or (2) promoter modulation. 

III.B.l. Insertion of an Exogenous Gene 

Insertion of an expression construct encoding an 

exogenous gene can boost the number of gene copies expressed in 

a host cell. 

Such expression constructs can comprise genes that 
either encode the native protein that is of interest or that 
encode a variant that exhibits enhanced activity as compared to 
the native protein. Such genes encoding proteins of interest 
can be constructed from the sequences from SEQ TABLES 1 AND 2, 
fragments thereof, and substantially similar sequence thereto. 

Such an exogenous gene can include either a 
constitutive promoter permitting expression in any cell in a 
host organism or a promoter that directs expression only in 
particular cells or times during a host cell life cycle or in 
response to environmental stimuli. 
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III.B.2. Promoter Modulation 

Some promoters require binding of a regulatory 
protein to be activated. Other promoters may need a protein 
that signals a promoter binding protein to expose a polymerase 
binding site. In either case, over-expression of such 

proteins can be used to enhance expression of a gene of 
interest by increasing the activation time of the promoter. 

Such regulatory proteins are encoded by some of the 
sequences in SEQ TABLES 1 AND 2, fragments thereof, and 
substantially similar sequences thereto. 

Coding sequences for these proteins can be constructed as 
described above. 

In some cases, duplication of enhancer elements or 
insertion of exogenous enhancer elements will increase 
expression of a desired gene from a particular promoter. The 
useful enhancer elements can be portions of one or more of the 
SDFs of SEQ TABLES 1 AND 2. 

IV. Gene Constructs and Vector Construction 

To use isolated SDFs of the present invention or a 
combination of them or parts and/or mutants and/or fusions of 
said SDFs in the above techniques, recombinant DNA vectors 
which comprise said SDFs and are suitable for transformation of 
cells, such as plant cells, are usually prepared. 

The vector backbone can be any of those typical in the 
art such as plasmids, viruses, artificial chromosomes, BACs, 
YACs and PACs and vectors of the sort described by **. 

Typically, a vector will comprise the exogenous gene, 
which in its turn comprises an SDF of the present invention to 
be introduced into the genome of a host cell, and which gene 
may be an antisense construct, a ribozyme construct, or a 
structural coding sequence with any desired transcriptional 
and/or translational regulatory sequences, such as promoters 
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and 3' end termination sequences. Vectors of the invention can 
also include origins of replication, markers, homologous 
sequences, introns, etc. 

A DNA sequence coding for the desired polypeptide, for 
example a cDNA sequence encoding a full length protein, will 
preferably be combined with transcriptional and translational 
initiation regulatory sequences which will direct the 
transcription of the sequence from the gene in the intended 
tissues of the transformed plant. 

For example, for over-expression, a plant promoter 
fragment may be employed that will direct expression of the 
gene in all tissues of a regenerated plant. Such promoters are 
referred to herein as "constitutive" promoters and are active 
under most environmental conditions and states of development 
or cell differentiation. Examples of constitutive promoters 
include the cauliflower mosaic virus (CaMV) 35S transcription 
initiation region, the 1' or 2' promoter derived from T-DNA of 
Agrobacterium tumefaciens, and other transcription initiation 
regions from various plant genes known to those of skill. 

Alternatively, the plant promoter may direct expression 
of an SDF of the invention in a specific tissue 
(tissue-specific promoters) or may be otherwise under more 
precise environmental control (inducible promoters) . Examples 
of tissue-specific promoters under developmental control 
include promoters that initiate transcription only in certain 
tissues, such as root, ovule, fruit, seeds, or flowers. The 
promoter from a LEC1 gene, described in copending application 
U.S. Ser. No. 09/103,478, is particularly useful for directing 
gene expression so that a desired gene product is located in 
embryos or seeds. Other suitable promoters include those from 
genes encoding storage proteins or the lipid body membrane 
protein, oleosin. A few root-specific promoters are noted 
above. Examples of environmental conditions that may affect 
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transcription by inducible promoters include anaerobic 
conditions, elevated temperature, or the presence of light. 

If proper polypeptide expression is desired, a 
polyadenylation region at the 3 1 -end of the coding region 
should be included. The polyadenylation region can be derived 
from the natural gene, from a variety of other plant genes, or 
from T-DNA. 

The vector comprising the sequences (e.g., promoters or 
coding regions) from genes of the invention will typically 
comprise a marker gene that confers a selectable phenotype on 
plant cells. For example, the marker may encode biocide 
resistance, particularly antibiotic resistance, such as 
resistance to kanamycin, G418, bleomycin, hygromycin, or 
herbicide resistance, such as resistance to chlorosulf uron or 
phosphinotricin . 

IV. A. Coding Sequences 

Generally, the sequence in the transformation vector 
and to be introduced into the genome of the host cell does not 
need to be absolutely identical to an SDF of the present 
invention. Also, it is not necessary for it to be full length, 
relative to either the primary transcription product or fully 
processed mRNA. Use of sequences shorter than full-length may 
be preferred to avoid concurrent production of some plants that 
are overexpressors. Furthermore, the introduced sequence need 
not have the same intron or exon pattern as a native gene. 
Also, heterologous non-coding segments can be incorporated 
into the coding sequence without changing the desired amino 
acid sequence of the polypeptide to be produced. 

IV. B. Promoters 

As explained above, introducing an exogenous SDF from 
the same species or an orthologous SDF from another species 
can modulate the expression of a native gene corresponding to 
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that SDF of interest. Such an SDF construct can be under the 
control of either a constitutive promoter (e.g., the promoter 
of the 35S gene of the cauliflower mosaic virus or the 
promotor of the gene encoding the cowpea trypsin inhibitor) 
or a highly regulated inducible promoter (e.g., a copper 
inducible promoter) . The promoter of interest can initially 
be either endogenous or heterologous to the species in 
question. When re-introduced into the genome of said species, 
such promoter becomes "exogenous" to said species. The 
promoter-SDF construct can be made using standard recombinant 
DNA techniques (Sambrook et al. 1989) and can be introduced 
to the species of interest by Agrobacterium-mediated 
transformation or by other means of transformation (e.g., 
particle gun bombardment) as referenced above. Over- 
expression of an SDF transgene can lead to co-suppression of 
the homologous gene thereby creating some alterations in the 
phenotypes of the transformed species as demonstrated by 
similar analysis of the chalcone synthase gene (Napoli et 
al., Plant Cell 2:279 (1990) and van der Krol et al., Plant 
Cell 2:291 (1990)). If an SDF is found to encode a protein 
with desirable characteristics, its over-expression can be 
controlled so that its accumulation can be manipulated in an 
organ- or tissue-specific manner utilizing a promoter having 
such specificity. 

Likewise, if the promoter of an SDF (or an SDF that 
includes a promoter) is found to be tissue-specific or 
developmentally regulated, such a promoter can be utilized to 
drive the expression of a specific gene of interest (e.g., 
seed storage protein or root-specific protein) . Thus, the 
level of accumulation of a particular protein can be 
manipulated or its spatial localization in an organ- or 
tissue- specific manner can be altered. 
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IV. C Signal Peptides 

In some cases it may be desirable for the protein 
encoded by an introduced exogenous or orthologous SDF to be 
targeted (1) to a particular organelle, (2) to interact with 
a particular molecule or (3) for secretion outside of the 
cell harboring the introduced SDF. This will be accomplished 
using a signal peptide. 

Signal peptides direct protein targeting, are involved 
in ligand-receptor interactions and act in cell to cell 
communication. Many proteins, especially soluble proteins, 
contain a signal peptide that targets the protein to one of 
several different intracellular compartments. In plants, 
these compartments include, but are not limited to, the 
endoplasmic reticulum (ER) , mitochondria, plastids (such as 
chloroplasts ) , the vacuole, the Golgi apparatus, protein 
storage vessicles (PSV) and, in general, membranes . Some 
signal peptide sequences are conserved, such as the Asn-Pro- 
Ile-Arg amino acid motif found in the N-terminal propeptide 
signal that targets proteins to the vacuole (Marty (1999) The 
Plant Cell 11: 587-599). Other signal peptides do not have a 
consensus sequence per se, but are largely composed of 
hydrophobic amino acids, such as those signal peptides 
targeting proteins to the ER (Vitale and Denecke (1999) The 
Plant Cell 11: 615-628). Still others do not appear to 
contain either a consensus sequence or an identified common 
secondary sequence, for instance the chloroplast stromal 
targeting signal peptides (Keegstra and Cline (1999) The 
Plant Cell 11: 557-570). Furthermore, some targeting 
peptides are bipartite, directing proteins first to an 
organelle and then to a membrane within the organelle (e.g. 
within the thylakoid lumen of the chloroplast; see Keegstra 
and Cline (1999) The Plant Cell 11: 557-570). In addition to 
the diversity in sequence and secondary structure, placement 
of the signal peptide is also varied. Proteins destined for 
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the vacuole, for example, have targeting signal peptides 
found at the N-terminus, at the C-terminus and at a surface 
location in mature, folded proteins. 

Signal peptides also serve as ligands for some 
receptors. Perhaps the best known example of this is the 
interaction of the ER targeting signal peptide with the 
signal recognition particle (SRP). Here, the SRP binds to 
the signal peptide, halting translation, and the resulting 
SRP complex then binds to docking proteins located on the 
surface of the ER, prompting the transfer of the protein into 
the ER. 

These characteristics of signal proteins can be used to 
more tightly control the expression of introduced SDFs. In 
particular, associating the appropriate signal sequence with 
a specific SDF can allow sequestering of the protein in 
specific organelles (plastids, as an example), secretion 
outside of the cell, targeting interaction with particular 
receptors, etc. Hence, the inclusion of signal proteins in 
constructs involving the SDFs of the invention increases the 
range of manipulation of SDF expression. To carry this out, 
constructs are made with the nucleotide sequence of a known 
signal peptide immediately 5' to the initiation of the coding 
region of an SDF so that the signal peptide is translated in 
frame with the coding region and immediately precedes it. 
The nucleotide sequence of the signal peptide can be isolated 
from characterized genes using common molecular biological 
techniques or can be synthesized in vitro. 

V. Transformation Techniques 

A wide range of techniques for inserting exogenous 
polynucleotides are known for a number of host cells, 
including, without limitation, bacterial, yeast, mammalian, 
insect and plant cells. 
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Techniques for transforming a wide variety of higher 
plant species are well known and described in the technical and 
scientific literature. See, e.g. Weising et al., Ann. .Rev. 
Genet. 22:421 (1988); and Christou, Euphytica, v. 85, n. 1-3:13- 
27, (1995). 

DNA constructs of the invention may be introduced into 
the genome of the desired plant host by a variety of 
conventional techniques. For example, the DNA construct may be 
introduced directly into the genomic DNA of the plant cell 
using techniques such as electroporation and microinjection of 
plant cell protoplasts, or the DNA constructs can be introduced 
directly to plant tissue using ballistic methods, such as DNA 
particle bombardment. Alternatively, the DNA constructs may be 
combined with suitable T-DNA flanking regions and introduced 
into a conventional Agrobacterium tumefaciens host vector. The 
virulence functions of the Agrobacterium tumefaciens host will 
direct the insertion of the construct and adjacent marker into 
the plant cell DNA when the cell is infected by the bacteria 
(Vergunst et al . , Nucl . Acids. Res. 26:2729 (1998) (site- 
directed integration using a Cre-Lox recombinase system) ; 
McCormac et al., Mol. Biotechnol. 8:199 (1997); Hamilton, Gene 
200 : 107 (1997)); Salomon et al. EMBO J. 3:141 (1984); Herrera- 
Estrella et al. EMBO J. 2:987 (1983). 

Microinjection techniques are known in the art and well 
described in the scientific and patent literature. The 
introduction of DNA constructs using polyethylene glycol 
precipitation is described in Paszkowski et al. EMBO J. 3:2717 
(1984). Electroporation techniques are described in Fromm et 
al. Proc. Natl Acad. Sci. USA 82:5824 (1985). Ballistic 
transformation techniques are described in Klein et al . Nature 
321 ill 3 (1987). Agrobacterium tumefaciens-mediated 

transformation techniques, including disarming and use of 
binary vectors, are well described in the scientific 
literature. See, for example Hamilton, CM., Gene 200 :107 
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(1997); Muller et al. Mol. Gen. Genet. 207:171 (1987); Komari 
et al. Plant J. 10:165 (1996); Venkateswarlu et al. 
Biotechnology 9:1103 (1991) and Gleave, AP. , Plant Mol. Biol. 
20:1203 (1992); Graves and Goldman, Plant Mol. Biol. 7:34 
5 (1986) and Gould et al., Plant Physiology 95:42 6 (1991). 

Transformed plant cells which are derived by any of the 
above transformation techniques can be cultured to regenerate a 
whole plant that possesses the transformed genotype and thus 
the desired phenotype such as seedlessness . Such regeneration 

10 techniques rely on manipulation of certain phytohormones in a 
tissue culture growth medium, typically relying on a biocide 
and/or herbicide marker which has been introduced together with 
the desired nucleotide sequences. Plant regeneration from 
cultured protoplasts is described in Evans et al., Protoplasts 

15 Isolation and Culture in "Handbook of Plant Cell Culture," pp. 
124-176, MacMillan Publishing Company, New York, 1983; and 
Binding, Regeneration of Plants, Plant Protoplasts , pp. 21-73, 
CRC Press, Boca Raton, 1988. Regeneration can also be obtained 
from plant callus, explants, organs, or parts thereof. Such 

20 regeneration techniques are described generally in Klee et al. 
Ann. Rev. of Plant Phys . 313:467 (1987). Regeneration of 
monocots (rice) is described by Hosoyama et al. {Biosci. 
Biotechnol. Biochem. 58:1500 (1994)) and by Ghosh et al. (J. 
Biotechnol. 32^:1 (1994)). The nucleic acids of the invention 

25 can be used to confer desired traits on essentially any plant. 

Thus, the invention has use over a broad range of plants, 
including species from the genera Asparagus , Atropa, Avena , 
Brassica , Citrus, Citrullus , Capsicum, Cucumis, Cucurbita , 
Daucus, Fragaria , Glycine , Gossypium, Helianthus , Heterocallis , 

30 Hordeum, Hyoscyamus, Lactuca , Linum, Lolium, Lycopersicon , 
Malus, Manihot , Majorana, Medicago , Nicotiana , Oryza, Panieum, 
Pannesetum, Persea, Pisum, Pyrus , Prunus, Raphanus , Secale, 
Senecio , Sinapis , Solanum , Sorghum, Trigonella , Triticum, 
V'tis, Vigna, and, Zea . 
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One of skill will recognize that after the expression 
cassette is stably incorporated in transgenic plants and 
confirmed to be operable, it can be introduced into other 
plants by sexual crossing. Any of a number of standard 
breeding techniques can be used, depending upon the species 
to be crossed. 

The particular sequences of SDFs identified are provided 
in the attached SEQ TABLES 1 AND 2. One of ordinary skill in 
the art, having this data, can obtain cloned DNA fragments, 
synthetic DNA fragments or polypeptides constituting desired 
sequences by recombinant methodology known in the art or 
described herein. 

EXAMPLES 

The invention is illustrated by way of the following 
examples. The invention is not limited by these examples as 
the scope of the invention is defined solely by the claims 
following . 

EXAMPLE 1: SOUTHERN HYBRIDIZATIONS 

The SDFs of the invention can be used in Southern 
hybridizations as described above. The following describes 
extraction of DNA from nuclei of plant cells, digestion of 
the nuclear DNA and separation by length, transfer of the 
separated fragments to membranes, preparation of probes for 
hybridization, hybridization and detection of the hybridized 
probe . 

The procedures described herein can be used to isolate 
related polynucleotides or for diagnostic purposes. Moderate 
stringency hybridization conditions, as defined above, are 
described in the present example. These conditions result in 
detection of hybridization between sequences having at least 
70% sequence identity. As described above, the hybridization 
and wash conditions can be changed to reflect the desired 
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degree of sequence identity between probe and target 
sequences that can be detected. 

In the following procedure, a probe for the 
hybridization is produced from two PCR reactions using two 
primers from genomic sequence of Arabidopsis thaliana. As 
described above, the particular template for generating the 
probe can be any desired template. 

The first PCR product is assessed to validate the size 
of the primer to assure it is of the expected size. Then the 
product of the first PCR is used as a template, with the same 
pair of primers used in the first PCR, in a second PCR that 
produces a labeled product used as the probe. 

Fragments detected by hybridization, or other bands of 
interest, can be isolated from gels used to separate genomic 
DNA fragments by known methods for further purification 
and/ or characterization . 



Buffers for nuclear DNA extraction 

1. 10X HB 





1000 ml 




40 mM 
spermidine 


10.2 g 


Spermine (Sigma S-2876) and 
spermidine (Sigma S-2501) 


10 mM spermine 


3.5 g 


Stabilize chromatin and the 
nuclear membrane 


0.1 M EDTA 
(disodium) 


37.2 g 


EDTA inhibits nuclease 


0.1 M Tris 


12. 1 g 


Buffer 


0.8 M KC1 


59. 6 g 


Adjusts ionic strength for 
stability of nuclei 



Adjust pH to 9.5 with 10 N NaOH. It appears that there 
is a nuclease present in leaves. Use of pH 9.5 appears 
to inactivate this nuclease. 
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2. 2 M sucrose (684 g per 1000 ml) 

Heat about half the final volume of water to about 50°C. 
Add the sucrose slowly then bring the mixture to close 
to final volume; stir constantly until it has dissolved. 
Bring the solution to volume. 

3. Sarkosyl solution (lyses nuclear membranes) 



10 



1000 ml 

N-lauroyl sarcosine (Sarkosyl) 20.0 g 

0.1 M Tris 12. 1 g 

0.04 M EDTA (Disodium) 14 . 9 g 



Adjust the pH to 9.5 after all the components are 
dissolved and bring up to the proper volume. 



15 



4 . 20% Triton X-100 

80 ml Triton X-100 

320 ml lxHB (w/o P~ME and PMSF) 

Prepare in advance; Triton takes some time to dissolve 



A. Procedure 

1. Prepare IX " H" buffer (keep ice-cold during use) 



20 



10X HB 

2 M sucrose 

Water 



1000 ml 
100 ml 

250 ml a non-ionic osmoticum 
634 ml 



Added just before use: 



25 



100 mM PMSF* 



10 ml a protease 
inhibitor; protects 
nuclear membrane proteins 
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Jl-mercaptoethanol 



1 ml inactivates nuclease 
by reducing disulfide 
bonds 



*100 mM PMSF 

(phenyl methyl sulfonyl fluoride, Sigma P-7626) 
(add 0.0875 g to 5 ml 100% ethanol) 



2. Homogenize the tissue in a blender (use 300-400 ml of 
lxHB per blender) . Be sure that you use 5-10 ml of HB 
buffer per gram of tissue. Blenders generate heat so 
be sure to keep the homogenate cold. It is necessary to 
put the blenders- in ice periodically. 

3. Add the 20% Triton X-100 (25 ml per liter of homogenate) 
and gently stir on ice for 20 min. This lyses plastid, 
but not nuclear, membranes. 



4. Filter the tissue suspension through several nylon 
filters into an ice-cold beaker. The first filtration 
is through a 250-micron membrane; the second is through 
an 85-micron membrane; the third is through a 50-micron 
membrane; and the fourth is through a 20-micron 
membrane. Use a large funnel to hold the filters. 
Filtration can be sped up by gently squeezing the liquid 
through the filters. 



5. Centrifuge the filtrate at 1200 x g for 20 min. at 4°C to 
pellet the nuclei. 



6. Discard the dark green supernatant. The pellet will 
have several layers to it. One is starch; it is white 
and gritty. The nuclei are gray and soft. In the early 
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steps, there may be a dark green and somewhat viscous 
layer of chloroplasts . 

VJash the pellets in about 25 ml cold H buffer (with 

Triton X-100) and resuspend by swirling gently and 
5 pipetting. After the pellets are resuspended. 

Pellet the nuclei again at 1200 - 1300 x g. Discard the 
supernatant . 

Repeat the wash 3-4 times until the supernatant has 
changed from a dark green to a pale green. This usually 
10 happens after 3 or 4 resuspensions . At this point, the 

pellet should be grayish white and very slippery. The 
Triton X-100 in these repeated steps helps to destroy 
the chloroplasts and mitochondria that contaminate the 
prep. 



Resuspend the nuclei foi a 
ml of H buffer and transfer 
125 ml Erlenmeyer flask. 



final time in a total of 15 
the suspension to a sterile 



7. Add 15 ml, dropwise, cold 2% Sarkosyl, 0.1 M Tris, 0.04 

M EDTA solution (pH 9.5) while swirling gently. This 

20 lyses the nuclei. The solution will become very 
viscous . 



8. Add 30 grams of CsCl and gently swirl at room 
temperature until the CsCl is in solution. The mixture 
will be gray, white and viscous. 

25 9. Centrifuge the solution at 11, 400 x g at 4°C for at 
least 30 min. The longer this spin is, the firmer the 
protein pellicle. 
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10. The result should be a clear green supernatant over a 
white pellet, and (perhaps) under a protein pellicle. 
Carefully remove the solution under the protein pellicle 
and above the pellet. Determine the density of the 
solution by weighing 1 ml of solution and add CsCl if 
necessary to bring to 1.57 g/ml. The solution contains 
dissolved solids (sucrose etc) and the refractive index 
alone will not be an accurate guide to CsCl 
concentration . 

11. Add 20 jul of 10 mg/ml EtBr per ml of solution. 

12. Centrifuge at 184,000 x g for 16 to 20 hours in a fixed- 
angle rotor. 

13. Remove the dark red supernatant that is at the top of 
the tube with a plastic transfer pipette and discard. 
Carefully remove the DNA band with another transfer 
pipette. The DNA band should be visible in room light; 
otherwise, use a long wave UV light to locate the band. 

14. Extract the ethidium bromide with isopropanol saturated 
with water and salt. Once the solution is clear, 
extract at least two more times to ensure that all of 
the EtBr is gone. Be very gentle, as it is very easy to 
shear the DNA at this step. This extraction may take a 
while because the DNA solution tends to be very viscous. 
If the solution is too viscous, dilute it with TE . 

15. Dialyze the DNA for at least two days against several 
changes (at least three times) of TE (10 mM Tris, ImM 
EDTA, pH 8) to remove the cesium chloride. 
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16. Remove the dialyzed DNA from the tubing. If the 
dialyzed DNA solution contains a lot of debris, 
centrifuge the DNA solution at least at 2500 x g for 10 
min. and carefully transfer the clear supernatant to a 

5 new tube. Read the A260 concentration of the DNA. 

17. Assess the quality of the DNA by agarose gel 
electophesis (1% agarose gel) of the DNA. Load 50 ng 
and 100 ng (based on the OD reading) and compare it with 
known and good quality DNA. Undigested lambda DNA and a 

10 lambda-Hindlll-digested DNA are good molecular weight 

makers . 

Protocol for Digestion of Genomic DNA 

Protocol : 

1. The relative amounts of DNA for different crop plants 
that provide approximately a balanced number of genome 

15 equivalent is given in Table 3. Note that due to the 

size of the wheat genome, wheat DNA will be 
underrepresented . Lambda DNA provides a useful control 
for complete digestion. 

2. Precipitate the DNA by adding 3 volumes of 100% ethanol. 
20 Incubate at -20°C for at least two hours. Yeast DNA can 

be purchased and made up at the necessary concentration, 
therefore no precipitation is necessary for yeast DNA. 

3. Centrifuge the solution at 11,400 x g for 20 min. 
Decant the ethanol carefully (be careful not to disturb 

25 the pellet) . Be sure that the residual ethanol is 

completely removed either by vacuum desiccation or by 
carefully wiping the sides of the tubes with a clean 
tissue . 
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4. Resuspend the pellet in an appropriate volume of water. 
Be sure the pellet is fully resuspended before 
proceeding to the next step. This may take about 30 
min . 

5 5. Add the appropriate volume of 10X reaction buffer 
provided by the manufacturer of the restrictionedr-enzyme 
to the resuspended DNA followed by the appropriate 
volume of enzymes. Be sure to mix it properly by slowly 
swirling the tubes. 

10 6. Set-up the lambda digestion-control for each DNA that 
you are digesting. 

7. Incubate both the experimental and lambda digests 
overnight at 37°C. Spin down condensation in a microfuge 
before proceeding . 

15 8. After digestion, add 2 |nl of loading dye (typically 
0.25% bromophenol blue, 0.25% xylene cyanol in 15% 
Ficoll or 30% glycerol) to the lambda-control digests 
and load in 1% TPE-agarose gel (TPE is 90 mM Tris- 
phosphate, 2 mM EDTA, pH 8) . If the lambda DNA in the 

20 lambda control digests are completely digested, proceed 

with the precipitation of the genomic DNA in the 
digests . 

9. Precipitate the digested DNA by adding 3 volumes of 100% 
ethanol and incubating in -20°C for at least 2 hours 
25 (preferably overnight) . 



EXCEPTION: Arabidopsis and yeast DNA are digested in an 
appropriate volume; they don't have to be 
precipitated . 
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10. Resuspend the DNA in an appropriate volume of TE (e.g., 
22 \il x 50 blots = 1100 \xl) and an appropriate volume of 
10X loading dye (e.g., 2.4 \xl x 50 blots = 120 ^1 ) . Be 
careful in pipetting the loading dye - it is viscous. 
5 Be sure you are pipetting the correct volume. 



Table 3 

Some guide points in digesting genomic DNA. 



Species 


Genome 
Size 


Size 

Relative -to 
Arabidopsis 


Genome 
Equivalent 
to 2 fig 
Arabidopsis 
DNA 


Amount 
of DNA 
per 
blot 


Arabidopsis 


120 Mb 


IX 


IX 


2 |ag 


Brassica 


1, 100 Mb 


9. 2X 


0. 54X 


10 ng 


Corn 


2, 800 Mb 


23 . 3X 


0.43X 


20 ng 


Cotton 


2,300 Mb 


19. 2X 


0. 52X 


20 ^g 


Oat 


11,300 Mb 


94X 


0.11X 


20 ng 


Rice 


400 Mb 


3 . 3X 


0.75X 


5 ng 


Soybean 


1, 100 Mb 


9. 2X 


0. 54X 


10 [iq 


Sugarbeet 


758 Mb 


6. 3X 


0.8X 


10 \xq 


Sweetclover 


1, 100 Mb 


9.2X 


0. 54X 


10 ng 


Wheat 


16,000 Mb 


133X 


0. 08X 


20 \ig 


Yeast 


15 Mb 


0. 12X 


IX 


0.25 \x.q 



10 Protocol for Southern Blot Analysis 

The digested DNA samples are electrophoresed in 1% agarose 
gels in lx TPE buffer. Low voltage; overnight separations 
are preferred. The gels are stained with EtBr and 

photographed . 

15 

1. For blotting the gels, first incubate the gel in 0.25 N 
HC1 (with gentle shaking) for about 15 min. 
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2. Then briefly rinse with water. The DNA is denatured by 
2 incubations. Incubate (with shaking) in 0.5 M NaOH in 
1.5 M NaCl for 15 min. • 

3. The gel is then briefly rinsed in water and neutralized 
by incubating twice (with shaking) in 1.5 M Tris pH 7.5 
in 1.5 M NaCl for 15 min. 

4 . A nylon membrane is prepared by soaking it in water for 
at least 5 min, then in 6X SSC for at least 15 min. 
before use. (20x SSC is 175.3 g NaCl, 88.2 g sodium 
citrate per liter, adjusted to pH 7.0.) 

5. The nylon membrane is placed on top of the gel and all 
bubbles in between are removed. The DNA is blotted from 
the gel to the membrane using an absorbent medium, such 
as paper toweling and 6x SCC buffer. After the 
transfer, the membrane may be lightly brushed with a 
gloved hand to remove any agarose sticking to the 
surface . 

6. The DNA is then fixed to the membrane by UV crosslinking 
and baking at 80°C. The membrane is stored at 4°C until 
use . 

B . Protocol for PCR Amplification of Genomic Fragments in 
Arabidopsis 



Amplification procedures : 

1. Mix the following in a 0.20 ml PCR tube or 96-well 
PCR plate: 







Final Amount or 


Volume 


Stock 


Cone. 
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0.5 ^.1 


~ 10 ng/|j.l genomic DNA 1 


o -tig 


2.5 |il 


10X PCR buffer 


20 mM Tris, 50 mM 

ia'w J- 


0.7 5 jil 


50 mM MgCl 2 


1 . 5 mM 


1 ni 


10 pmol/fil Primer 1 
(to rwdia j 


10 pmol 


1 ill 


10 pmol/|Lil Primer 2 
( Reverse ) 


10 pmol 


0.5 ul 


D TCuyi alM X rr S 


u . i mjyi 


0.1 nl 


5 units/jil Platinum Taq™ 
(Life Technologies, 
Gaithersburg, MD) DNA 
Polymerase 


1 units 


(to 25 


Water 





2. The template DNA is amplified using a Perkin Elmer 
9700 PCR machine: 



1) 94°C for 10 min. followed by 



11 

5 cycles: 


11 

5 cycles: 


11 

25 cycles: 


94 °C - 30 sec 
62 °C - 30 sec 
72 °C - 3 min 


94 °C - 30 sec 
58 °C - 30 sec 
7 2 °C - 3 min 


94 °C - 30 sec 
53 °C - 30 sec 
72 °C - 3 min 



5) 72 C for 7 min. Then the reactions are stopped by 
5 chilling to 4°C. 



1 Arabidopsis DNA is used in the present experiment, but the procedure is a general one. 
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The procedure can be adapted to a multi-well format if 
necessary. 



Quantification and Dilution of PCR Products : 

1. The product of the PCR is analyzed by electrophoresis in 
5 a 1% agarose gel. A linearized plasmid DNA can be used 

as a quantification standard (usually at 50, 100, 200, 
and 400 ng) . These will be used as references to 
approximate the amount of PCR products. Hindlll-digested 
Lambda DNA is useful as a molecular weight marker. The 
10 gel can be run fairly quickly; e.g., at 100 volts. The 

standard gel is examined to determine that the size of 
the PCR products is consistent with the expected size 
and if there are significant extra bands or smeary 
products in the PCR reactions. 

15 2. The amounts of PCR products can be estimated on the 
basis of the plasmid standard. 

3. For the small number of reactions that produce 
extraneous bands, a small amount of DNA from bands with 
the correct size can be isolated by dipping a sterile 
20 10-|xl tip into the band while viewing though a UV 

Transilluminator. The small amount of agarose gel (with 
the DNA fragment) is used in the labeling reaction. 

C. Protocol for PCR-DIG-Labeling of DNA 

Solutions : 

25 Reagents in PCR reactions (diluted PCR products, 10X PCR 

Buffer, 50 mM MgCl 2 , 5 U/|al Platinum Taq Polymerase, and 
the primers) 



10X dNTP + DIG-ll-dUTP [1:5]: (2 mM dATP, 2 mM dCTP, 2 
mM dGTP, 1.65 mM dTTP, 0.35 mM DIG-ll-dUTP) 
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10X dNTP + DIG-ll-dUTP [1:10]: (2 mM dATP, 2 mM dCTP, 2 
mM dGTP, 1.81 mM dTTP, 0.19 mM DIG-ll-dUTP) 

10X dNTP + DIG-ll-dUTP [1:15]: (2 mM dATP, 2 mM dCTP, 2 
mM dGTP, 1.875 mM dTTP, 0.125 mM DIG-ll-dUTP) 

TE buffer (10 mM Tris, 1 mM EDTA, pH 8) 

Maleate buffer: In 700 ml of deionized distilled water, 
dissolve 11.61 g maleic acid and 8.77 g NaCl . Add NaOH 
to adjust the pH to 7.5. Bring the volume to 1 L. Stir 
for 15 min. and sterilize. 

10% blocking solution: In 80 ml deionized distilled 
water, dissolve 1.16g maleic acid. Next, add NaOH to 
adjust the pH to 7.5. Add 10 g of the blocking reagent 
powder (Boehringer Mannheim, Indianapolis, IN, Cat. no. 
1096176). Heat to 60°C while stirring to dissolve the 
powder. Adjust the volume to 100 ml with water. Stir 
and sterilize. 

1% blocking solution: Dilute the 10% stock to 1% using 
the maleate buffer. 

Buffer 3 (100 mM Tris, 100 mM NaCl, 50 mM MgCl 2 , pH9.5). 
Prepared from autoclaved solutions of 1M Tris pH 9.5, 5 
M NaCl, and 1 M MgCl 2 in autoclaved distilled water. 
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Procedure : 



PCR reactions are performed in 25 ^il volumes containing: 



PCR buffer 
MgCl 2 

10X dNTP + DIG-ll-dUTP 

Platinum Taq™ Polymerase 
10 pg probe DNA 
10 pmol primer 1 



IX 

1.5 mM 

IX (please see the 

note below) 
1 unit 



10 



Note : 



IPX dNTP 4- DIG-ll-dUTP (1:5) 



10X dNTP + DIG-ll-dUTP (1:10) 
10X dNTP + DIG-ll-dUTP (1:15) 



Use for: 
< 1 kb 



1 kb to 1.8 kb 
> 1 . 8 kb 



15 



The PCR reaction uses the following amplification 
cycles : 



1) 94°C for 10 min. 



21 

5 cycles: 


2L 

5 cycles : 


±L 

25 cycles: 


95°C - 30 sec 
61°C - 1 min 
7 3°C - 5 min 


95°C - 30 sec 
5 9°C - 1 min 
75°C - 5 min 


95°C - 30 sec 
51°C - 1 min 
73°C - 5 min 



5) 72°C for 8 min. The reactions are terminated by 
chilling to 4°C (hold) . 



3. 

20 



The products are analyzed by electrophoresis- in a 1% 
agarose gel, comparing to an aliquot of the unlabelled 
probe starting material. 
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4. The amount of DIG-labeled probe is determined as 
follows : 



Make serial dilutions of the diluted control DNA in 
dilution buffer (TE: 10 mM Tris and 1 mM EDTA, pH 8) as 
shown in the following table: 



DIG-labeled 
control DNA 
starting cone . 


Stepwise Dilution 


Final Cone . 
(Dilution Name) 


5 ng/(o.l 


1 (Jl in 49 Hi TE 


100 pg/nl (A) 


100 pg/fal (A) 


25 jil in 25 jal TE 


50 pg/\il (B) 


50 pg/fj.1 (B) 


25 \il in 25 ]xl TE 


25 pg/^1 (C) 


25 pg/nl (C) 


20 \il in 30 \il TE 


10 pg/nl (D) 



a. Serial deletions of a DIG-labeled standard DNA 
ranging from 100 pg to 10 pg are spotted onto a 
positively charged nylon membrane, marking the 
membrane lightly with a pencil to identify each 
dilution . 



b. Serial dilutions (e.g., 1:50, 1:2500, 1:10,000) of 
the newly labeled DNA probe are spotted. 

c. The membrane is fixed by UV crosslinking . 

d. The membrane is wetted with a small amount of 
maleate buffer and then incubated in 1% blocking 
solution for 15 min at room temp. 

e. The labeled DNA is then detected using alkaline 
phosphatase conjugated anti-DIG antibody 

(Boehringer Mannheim, Indianapolis, IN, cat. no. 
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1093274) and an NBT substrate according to the 
manufacture 1 s instruction. 

f. Spot intensities of the control and experimental 
dilutions are then compared to estimate the 
concentration of the PCR-DIG-labeled probe. 

D. Prehybridization and Hybridization of Southern Blots 

Solutions : 



100% Formamide 



20X SSC 



per L : 



purchased from Gibco 

(IX = 0.15 M NaCI, 0.015 
Na 3 citrate) 
175 g NaCI 

87.5 g Na 3 citrate-2H 2 0 



M 



20% Sarkosyl (N-lauroyl-sarcosine) 
20% SDS (sodium dodecyl sulphate) 

10% Blocking Reagent: In 80 ml deionized distilled 
water, dissolve 1.16 g maleic acid. Next, add NaOH 
to adjust the pH to 7.5. Add 10 g of the blocking 
reagent powder. Heat to 60°C while stirring to 
dissolve the powder. Adjust the volume to 100 ml 
with water. Stir and sterilize. 



Prehybridization Mix: 



Final 

Concentration 


Components 


Volume 
(per 100 ml) 


Stock 


50% 


Formamide 


50 ml 


100% 


5X 


SSC 


2 5 ml 


20X 


0 . 1% 


Sarkosyl 


0 . 5 ml 


20% 



WO 00/40695 PCT/US00/00466 



104 



0.02% 


SDS 


0.1 ml 


20% 


2% 


Blocking Reagent 


2 0 ml 


10% 




Water 


4.4 ml 





General Procedures : 

1. Place the blot in a heat-sealable plastic bag and add an 
appropriate volume of prehybridizat ion solution (30 
ml/100cm 2 ) at room temperature. Seal the bag with a heat 
sealer, avoiding bubbles as much as possible. Lay down 
the bags in a large plastic tray (one tray can 
accommodate at least 4-5 bags) . Ensure that the bags 
are lying flat in the tray so that the prehybridizat ion 
solution is evenly distributed throughout the bag. 
Incubate the blot for at least 2 hours with gentle 
agitation using a waver shaker. 

2. Denature DIG-labeled DNA probe by incubating for 10 min. 
at 98°C using the PCR machine and immediately cool it to 
4°C. 

3. Add probe to prehybridi zat ion solution (25 ng/ml; 30 ml 
= 750 ng total probe) and mix well but avoid foaming. 
Bubbles may lead to background. 

4 . Pour off the prehybridizat ion solution from the 
hybridization bags and add new prehybridizat ion and 
probe solution mixture to the bags containing the 
membrane . 

5. Incubate with gentle agitation for at least 16 hours. 

6. Proceed to medium stringency post-hybridization wash: 
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Three times for 20 min. each with gentle agitation using 
IX SSC, 1% SDS at 60°C. 

All wash solutions must be prewarmed to 60°C. Use about 
100 ml of wash solution per membrane. 

5 To avoid background keep the membranes fully submerged 

to avoid drying in spots; agitate sufficiently to avoid 
having membranes stick to one another. 

7. After the wash, proceed to immunological detection and 
CSPD development. 

10 E . Procedure for Immunological Detection with CSPD 

Solutions : 



Buffer 1: 



Maleic acid buffer (0.1 M maleic 
acid, 0.15 M NaCl; adjusted to pH 
7 . 5 with NaoH) 



15 Washing buffer: 



Maleic acid buffer with 0.3% (v/v) 
Tween 20. 



Blocking stock solution 10% blocking reagent in buffer 1. 

Dissolve (10X concentration) : 

blocking reagent powder (Boehringer 
20 Mannheim, Indianapolis, IN, cat. no. 

1096176) by constantly stirring on a 
65°C heating block or heat in a 
microwave, autoclave and store at 
4°C. 



25 Buffer 2 
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Dilute the stock solution 1:10 in 



Buffer 1. 



Detection buffer: 



0.1 M Tris, 0.1 M NaCI, pH 9.5 



Procedure : 

1. After the post-hybridization wash the blots are briefly 
rinsed (1-5 min.) in the maleate washing buffer with 
gentle shaking. 

2. Then the membranes are incubated for 30 min. in Buffer 2 
with gentle shaking. 

3. Ant i-DIG-AP conjugate (Boehringer Mannheim, 
Indianapolis, IN, cat. no. 1093274) at 75 mU/ml 
(1:10,000) in Buffer 2 is used for detection. 75 ml of 
solution can be used for 3 blots. 

4. The membrane is incubated for 30 min. in the antibody 
solution with gentle shaking. 

5. The membrane are washed twice in washing buffer with 
gentle shaking. About 250 mis is used per wash for 3 
blots . 

6. The blots are equilibrated for 2-5 min in 60 ml 
detection buffer. 

7. Dilute CSPD (1:200) in detection buffer. (This can be 
prepared ahead of time and stored in the dark at 4°C) . 

The following steps must be done individually. Bags (one 
for detection and one for exposure) should be cut and 
ready before doing the following steps. 
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8. The blot is carefully removed from the detection buffer 
and excess liquid removed without drying the membrane. 
The blot is immediately placed in a bag and 1.5 ml of 
CSPD solution is added. The CSPD solution can be spread 
over the membrane. Bubbles present at the edge and on 
the surface of the blot should be removed by gentle 
rubbing. The membrane is incubated for 5 min. in CSPD 
solution . 

9. Excess liquid is removed and the membrane is blotted 
briefly (DNA side up) on Whatman 3 MM paper. Do not let 
the membrane dry completely. 

10. Seal the damp membrane in a hybridization bag and 
incubate for 10 min at 37°C to enhance the luminescent 
reaction . 

11. Expose for 2 hours at room temperature to X-ray film. 
Multiple exposures can be taken. Luminescence continues 
for at least 24 hours and signal intensity increases 
during the first hours. 

Example 2 : Transformation of Carrot Cells 

Transformation of plant cells can be accomplished by a 
number of methods, as described above. Similarly, a number 
of plant genera can be regenerated from tissue culture 
following transformation. Transformation and regeneration of 
carrot cells as described herein is illustrative. 

Single cell suspension cultures of carrot {Daucus 
carota) cells are established from hypocotyls of cultivar 
Early Nantes in B 5 growth medium (O.L. Gamborg et al . , Plant 
Physiol. 45:372 (1970)) plus 2,4-D and 15 mM CaCl 2 (B 5 -44 
medium) by methods known in the art. The suspension cultures 
are subcultured by adding 10 ml of the suspension culture to 



WO 00/40695 PCT/USOO/00466 

108 

40 ml of B 5 -44 medium in 250 ml flasks every 7 days and are 
maintained in a shaker at 150 rpm at 27 °C in the dark. 

The suspension culture cells are transformed with 
exogenous DNA as described by Z. Chen et al . Plant Mol . Bio. 
3_6:163 (1998). Briefly, 4-days post-subculture cells are 
incubated with cell wall digestion solution containing 0.4 M 
sorbitol, 2% driselase, 5mM MES (2- [N-Morpholino] 
ethanesulf onic acid) pH 5.0 for 5 hours. The digested cells 
are pelleted gently at 60 xg for 5 min. and washed twice in 
W5 solution containing 154 mM NaCI, 5 mM KC1, 125 mM CaCl 2 and 
5mM glucose, pH 6.0. The protoplasts are suspended in MC 
solution containing 5 mM MES, 20 mM CaCl 2/ 0.5 M mannitol, pH 
5.7 and the protoplast density is adjusted to about 4 x 10 6 
protoplasts per ml. 

15-60 jig of plasmid DNA is mixed with 0.9 ml of 
protoplasts. The resulting suspension is mixed with 40% 
polyethylene glycol (MW 8000, PEG 8000) , by gentle inversion 
a few times at room temperature for 5 to 25 min. Protoplast 
culture medium known in the art is added into the PEG-DNA- 
protoplast mixture. Protoplasts are incubated in the culture 
medium for 24 hour to 5 days and cell extracts can be used 
for assay of transient expression of the introduced gene. 
Alternatively, transformed cells can be used to produce 
transgenic callus, which in turn can be used to produce 
transgenic plants, by methods known in the art. See, for 
example, Nomura and Komamine, Pit. Phys. 7J9: 988-991 (1985), 
Identification and Isolation of Single Cells that Produce 
Somatic Embryos in Carrot Suspension Cultures . 

The invention being thus described, it will be apparent 
to one of ordinary skill in the art that various 
modifications of the materials and methods for practicing the 
invention can be made. Such modifications are to be 
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considered within the scope of the invention as defined by 
the following claims. 

Each of the references from the patent and periodical 
literature cited herein is hereby expressly incorporated in its 
5 entirety by such citation. 
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REF TABLE 1 

Maximum Length Sequence: 

related to: 
Clone IDs: 
9581 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 1 

- Ceres seq_id 1007546 

- Alternative transcription start site(s) located in SEQ 
ID NO 1: 

-96,-51,25,27,29,55, 64 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 2 

- Ceres seq_id 1007547 

- Location of start within SEQ ID NO 1: at 1 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 2: at 32 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 1 

- gi No. 3879939 

- Description: 

- % Identity: 45.9 

- Alignment Length: 61 

- Location of Alignment in SEQ ID NO 2: from 48 to 

108 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 3 

- Ceres seq_id 1007548 

- Location of start within SEQ ID NO 1: at 100 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 2 

- gi No. 3879939 

- Description: 

- % Identity: 45.9 

- Alignment Length: 61 

- Location of Alignment in SEQ ID NO 3: from 15 to 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 

- Ceres seq__id 1007549 

- Location of start within SEQ ID NO 1: at 121 nt . 



10 



45 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 3 

- gi No. 3879939 

- Description: 

- % Identity: 45.9 

- Alignment Length: 61 

- Location of Alignment in SEQ ID NO 4 : from 8 to 68 

(Ba) Polypeptide Activities: Similar to yeast membrane 
protein activities 



15 Maximum Length Sequence: 
related to: 
Clone IDs: 
9568 
402131 

20 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 5 

- Ceres seq_id 1007583 

- Alternative transcription start site(s) located in SEQ 
ID NO 5: 

25 2, 3, 4,7,10, 11, 12 ,17, 42,43, 300, 505 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 6 

- Ceres seq_id 1007584 

30 - Location of start within SEQ ID NO 5: at 55 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- KH domain 

35 - Location within SEQ ID NO 6: from 47 to 95 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 4 

- gi No. 133940 
40 - Description: 

- % Identity: 75.1 

- Alignment Length: 250 

- Location of Alignment in SEQ ID NO 6: from 1 to 



249 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 7 

- Ceres seq_id 1007585 

- Location of start within SEQ ID NO 5: at 184 nt . 



50 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- KH domain 

- Location within SEQ ID NO 7: from 4 to 52 aa . 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 5 

- gi No. 133940 

- Description : 

10 - % Identity: 75.1 

- Alignment Length: 250 

- Location of Alignment in SEQ ID NO 7: from 1 to 

206 



15 Maximum Length Sequence: 
related to: 
Clone IDs: 
91769 

(Ac) cDNA Polynucleotide Sequence 
20 - Pat. Appln. SEQ ID NO 8 

- Ceres seq_id 1008148 

- Alternative transcription start site(s) located in SEQ 
ID NO 8: 

-19, 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 14 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 9 

- Ceres seq_id 1008149 

- Location of start within SEQ ID NO 8: at 3 nt . 

30 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 6 
35 - gi No. 4539292 

- Description: 

- % Identity: 89.9 

- Alignment Length: 181 

- Location of Alignment in SEQ ID NO 9: from 25 to 

40 203 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 10 

- Ceres seq_id 1008150 

45 - Location of start within SEQ ID NO 8: at 75 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
50 - Alignment No. 7 

- gi No. 4539292 
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- Description: 

- % Identity: 89.9 

- Alignment Length: 181 

- Location of Alignment in SEQ ID NO 10: from 1 to 



179 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 11 

- Ceres seq_id 1008151 

10 - Location of start within SEQ ID NO 8: at 210 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
15 - Alignment No. 8 

- gi No. 4539292 

- Description: 

- % Identity: 89.9 

- Alignment Length: 181 

20 - Location of Alignment in SEQ ID NO 11: from 1 to 

134 

(Ba) Polypeptide Activities: Similar to 40S Ribosomal 
protein activities, and glycine rich RNA binding protein 
25 activities. 



Maximum Length Sequence: 
related to: 
30 Clone IDs: 
8898 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 12 

- Ceres seq__id 1008334 

35 - Alternative transcription start site(s) located in SEQ 

ID NO 12: 

-12, 29, 30 

(B) Polypeptide Sequence 
40 - Pat. Appln. SEQ ID NO 13 

- Ceres seq_id 1008335 

- Location of start within SEQ ID NO 12: at 2 nt . 

(C) Nomination and Annotation of Domains within 
45 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 9 

- gi No. 3602948 

- Description: 

50 - % Identity: 43.7 

- Alignment Length: 71 
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- Location of Alignment in SEQ ID NO 13: from 178 

to 248 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 14 

- Ceres seq_id 1008336 

- Location of start within SEQ ID NO 12: at 59 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 14 : at 22 aa. 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 10 
15 - gi No. 3602948 

- Description: 

- % Identity: 43.7 

- Alignment Length: 71 

- Location of Alignment in SEQ ID NO 14: from 159 

20 to 229 



(Ba) Polypeptide Activities: Plant specific gene, Chloroplast 
specific gene. 

2 5 Maximum Length Sequence: 
related to: 
Clone IDs: 
8286 

(Ac) cDNA Polynucleotide Sequence 
30 - Pat. Appln. SEQ ID NO 15 

- Ceres seq_id 1008701 

- Alternative transcription start site(s) located in SEQ 
ID NO 15: 

-6,2,3, 5,6,7, 18,24, 25, 28, 31,33,35,37,42, 50 

35 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 16 

- Ceres seq_id 1008702 

- Location of start within SEQ ID NO 15: at 1 nt . 

40 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 11 
45 - gi No. 1083282 

- Description: 

- % Identity: 41.4 

- Alignment Length: 106 

- Location of Alignment in SEQ ID NO 16: from 23 to 

5 0 121 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 17 

- Ceres seq__id 1008703 

- Location of start within SEQ ID NO 15: at 67 nt . 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 12 
10 - gi No. 1083282 

- Description: 

- % Identity: 41.4 

- Alignment Length: 106 

- Location of Alignment in SEQ ID NO 17: from 1 to 

15 99 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 18 

- Ceres seq_id 1008704 

20 - Location of start within SEQ ID NO 15: at 2 nt . 

(Ba) Polypeptide Activities: Similar to cytochrome C oxidase 
activities . 

25 

Maximum Length Sequence: 

related to: 
Clone IDs: 
7792 

30 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 19 

- Ceres seq_id 1009003 

- Alternative transcription start site(s) located in SEQ 
ID NO 19: 

35 2,374 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 20 

- Ceres seq_id 1009004 

40 - Location of start within SEQ ID NO 19: at 48 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
45 - Alignment No. 13 

- gi No. 3582320 

- Description: 

- % Identity: 32.6 

- Alignment Length: 44 

50 - Location of Alignment in SEQ ID NO 20: from 51 to 

93 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 21 

- Ceres seq__id 1009005 

5 - Location of start within SEQ ID NO 19: at 57 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
10 - Alignment No. 14 

- gi No. 3582320 

- Description : 

- % Identity: 32.6 

- Alignment Length: 4 4 

15 - Location of Alignment in SEQ ID NO 21: from 48 to 

90 

(Ba) Polypeptide Activities: Similar to Ring-H2 Zinc Finger 
Protein activities. 

20 

Maximum Length Sequence: 

related to: 
Clone IDs: 
25 7337 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 22 

- Ceres seq_id 1009345 

- Alternative transcription start site(s) located in SEQ 
30 ID NO 22 : 

2 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 23 
35 - Ceres seq_id 1009346 

- Location of start within SEQ ID NO 22: at 50 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 23: at 22 aa . 

40 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 15 

- gi No. 3176705 
45 - Description: 

- % Identity: 53.3 

- Alignment Length: 75 

- Location of Alignment in SEQ ID NO 23: from 1 to 

75 

50 

(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 24 

- Ceres seq__id 1009347 

- Location of start within SEQ ID NO 22: at 62 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
10 - Alignment No. 16 

- gi No. 3176705 

- Description: 

- % Identity: 53.3 

- Alignment Length: 75 

15 - Location of Alignment in SEQ ID NO 24: from 1 to 

71 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

20 specific gene, plant 

specific gene. 

Maximum Length Sequence: 
related to: 
25 Clone IDs: 
6349 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 2 5 

- Ceres seq_id 1010140 

30 - Alternative transcription start site(s) located in SEQ 

ID NO 25: 

-31, -2 9, 4, 5, 6, 10, 17, 34, 41, 749 



(B) Polypeptide Sequence 
35 - Pat. Appln. SEQ ID NO 26 

- Ceres seq_id 1010141 

- Location of start within SEQ ID NO 25: at 3 nt . 

(C) Nomination and Annotation of Domains within 
40 Predicted Polypeptide ( s ) 

- Bacterial regulatory proteins, deoR family 

- Location within SEQ ID NO 26: from 57 to 95 aa. 



(Dp) Related Amino Acid Sequences 
45 - Alignment No. 17 

- gi No. 3257798 

- Description: 

- % Identity: 40.6 

- Alignment Length: 234 

50 - Location of Alignment in SEQ ID NO 26: from 56 to 

289 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 27 

- Ceres seq__id 1010142 

5 - Location of start within SEQ ID NO 25: at 42 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Bacterial regulatory proteins, deoR family 
10 - Location within SEQ ID NO 27: from 44 to 82 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 18 

- gi No. 3257798 
15 - Description: 

- % Identity: 40.6 

- Alignment Length: 2 34 

- Location of Alignment in SEQ ID NO 27: from 43 to 

276 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 28 

- Ceres seq_id 1010143 

- Location of start within SEQ ID NO 25: at 231 nt . 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 19 
30 - gi No. 3257798 

- Description: 

- % Identity: 40.6 

- Alignment Length: 234 

- Location of Alignment in SEQ ID NO 28: from 1 to 

35 213 



Maximum Length Sequence: 

related to: 
Clone IDs: 
40 6261 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 29 

- Ceres seq_id 1010217 

- Alternative transcription start site(s) located in SEQ 
45 ID NO 29 : 

2,5,15 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 30 
50 - Ceres seq_id 1010218 

- Location of start within SEQ ID NO 29: at 85 nt . 
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- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 30: at 22 aa. 

(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 20 

- gi No. 3341723 

- Description: 

10 - % Identity: 64.3 

- Alignment Length: 131 

- Location of Alignment in SEQ ID NO 30: from 1 to 

118 

15 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 31 

- Ceres seq_id 1010219 

- Location of start within SEQ ID NO 29: at 118 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
20 ID NO 31: at 13 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
25 - Alignment No. 21 

- gi No. 3341723 

- Description: 

- % Identity: 64.3 

- Alignment Length: 131 

30 - Location of Alignment in SEQ ID NO 31: from 1 to 

107 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 32 
35 - Ceres seq_id 1010220 

- Location of start within SEQ ID NO 29: at 121 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
40 (Dp) Related Amino Acid Sequences 

- Alignment No. 22 

- gi No. 3341723 

- Description: 

- % Identity: 64.3 

45 - Alignment Length: 131 

- Location of Alignment in SEQ ID NO 32: from 1 to 

106 



(Ba) Polypeptide Activities: Similar to Constans like 
50 Protein activities and Zinc Finger Protein Activities. 
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Maximum Length Sequence: 

related to: 
Clone IDs: 
5 6145 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 33 

- Ceres seq_id 1010302 

- Alternative transcription start site(s) located in SEQ 
10 ID NO 33: 

-5 ,-3, -2 ,-1,2,3,4, 5,6,7, 8, 10,11, 12 ,13, 15,19, 23,45, 349 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 34 
15 - Ceres seq_id 1010303 

- Location of start within SEQ ID NO 33: at 59 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
20 - Pathogenesis-related protein Bet v I family 

- Location within SEQ ID NO 34: from 5 to 155 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 23 
25 - gi No. 1321731 

- Description: 

- % Identity: 35.7 

- Alignment Length: 159 

- Location of Alignment in SEQ ID NO 34: from 5 to 

30 155 



Maximum Length Sequence: 

related to: 
Clone IDs: 
35 5180 

Public Genomic DNA: 
gi No: 4757410 
Predicted Exons: 

INTR 37202 . . . 37397 OCKHAMG-CDNA 
40 INTR 37493 ... 37825 OCKHAMG-CDNA 



INIT 37271 ... 37397 OCKH AMG -CDS 
TERM 37493 ... 37704 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 
45 - Pat. Appln. SEQ ID NO 35 

- Ceres seq__id 1010815 

- Alternative transcription start site(s) located in SEQ 
ID NO 35: 

15, 16, 17, 18, 19,29, 31, 34 

50 

(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 36 

- Ceres seq__id 1010816 

- Location of start within SEQ ID NO 35: at 70 nt . 



5 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 24 

- gi No. 2879811 
10 - Description: 

- % Identity: 88.4 

- Alignment Length: 112 

- Location of Alignment in SEQ ID NO 36: from 1 to 

112 

15 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 37 

- Ceres seq_id 1010817 

- Location of start within SEQ ID NO 35: at 133 nt . 

20 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 25 
25 - gi No. 2879811 

- Description: 

- % Identity: 88.4 

- Alignment Length: 112 

- Location of Alignment in SEQ ID NO 37: from 1 to 

30 91 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 38 

- Ceres seq_id 1010818 

35 - Location of start within SEQ ID NO 35: at 257 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 38 : at 26 aa. 



(Ba) Polypeptide Activities: Similar to ribosomal protein 
40 actitivies. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
45 42842 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 39 

- Ceres seq_id 1011437 

- Alternative transcription start site(s) located in SEQ 
5 0 ID NO 39: 

-28 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 40 

- Ceres seq_id 1011438 

5 - Location of start within SEQ ID NO 39: at 2 nt. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 41 

- Ceres seq_id 1011439 

10 - Location of start within SEQ ID NO 39: at 1 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 41: at 29 aa. 

(C) Nomination and Annotation of Domains within 
15 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 2 6 

- gi No. 3334271 

- Description: 

20 - % Identity: 29.6 

- Alignment Length: 54 

- Location of Alignment in SEQ ID NO 41: from 13 to 

65 

25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 42 

- Ceres seq_id 1011440 

- Location of start within SEQ ID NO 39: at 28 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
30 ID NO 42: at 20 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 27 

- gi No. 3334271 

- Description: 

- % Identity: 29.6 

- Alignment Length: 54 

40 - Location of Alignment in SEQ ID NO 42: from 4 to 

56 

(Ba) Polypeptide Activities: Similar to NADH Oxidoxductase 
MWFE Subunit Protein Activities. 

45 

Maximum Length Sequence: 

related to: 
Clone IDs: 
50 42475 

(Ac) cDNA Polynucleotide Sequence 
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- Pat. Appln. SEQ ID NO 43 

- Ceres seq__id 1011616 

- Alternative transcription start site(s) located in SEQ 
ID NO 43: 

5 3,5,476 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 44 

- Ceres seq_id 1011617 

10 - Location of start within SEQ ID NO 43: at 115 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
15 - Alignment No. 28 

- gi No. 3859560 

- Description: 

- % Identity: 36.2 

- Alignment Length: 225 

20 - Location of Alignment in SEQ ID NO 44: from 31 to 

248 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 45 
25 - Ceres seq_id 1011618 

- Location of start within SEQ ID NO 43: at 406 nt . 

(C) Nomination and Annotation of Doma^ "^s within 
Predicted Polypeptide (s) 
30 (Dp) Related Amino Acid Sequences 

- Alignment No. 29 

- gi No. 3859560 

- Description: 

- % Identity: 36.2 

35 - Alignment Length: 225 

- Location of Alignment in SEQ ID NO 45: from 1 to 

151 

(Ba) Polypeptide Activities: Similar to acycl-protein 
40 thioosterases protein activities, calcium independent 

phospholipase A2 activities, and carboxy les terase activities. 



4 5 Maximum Length Sequence: 
related to: 
Clone IDs: 
42405 

(Ac) cDNA Polynucleotide Sequence 
50 - Pat. Appln. SEQ ID NO 46 

- Ceres seq_id 1011631 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 47 
5 - Ceres seq_id 1011632 

- Location of start within SEQ ID NO 46: at 3 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
10 (Dp) Related Amino Acid Sequences 

- Alignment No. 30 

- gi No. 3618318 

- Description: 

- % Identity: 72.7 

15 - Alignment Length: 44 

- Location of Alignment in SEQ ID NO 47: from 91 to 

134 



(B) Polypeptide Sequence 
2 0 - Pat. Appln. SEQ ID NO 4 8 

- Ceres seq_id 1011633 

- Location of start within SEQ ID NO 46: at 9 nt . 



(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 31 

- gi No. 3618318 

- Description: 

30 - % Identity: 72.7 

- Alignment Length: 44 

- Location of Alignment in SEQ ID NO 48: from 89 to 

132 



35 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 9 

- Ceres seq_id 1011634 

- Location of start within SEQ ID NO 46: at 15 nt . 



40 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 32 

- gi No. 3618318 
45 - Description: 

- % Identity: 72.7 

- Alignment Length: 4 4 

- Location of Alignment in SEQ ID NO 49: from 87 to 

130 

50 
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(Ba) Polypeptide Activities: Similar to Constans protein 
activities, and zinc finger protein activities. 

Maximum Length Sequence: 
5 related to: 

Clone IDs: 
42240 

(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 50 
10 - Ceres seq_id 1011714 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 51 
15 - Ceres seq_id 1011715 

- Location of start within SEQ ID NO 50: at 2 nt . 



(C) Nomination and Ann 
Predicted Polypeptide (s) 
20 - Bacterial mutT 

- Location within 



otation of Domains within 
protein 

SEQ ID NO 51: from 26 to 67 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 33 
25 - gi No. 2129134 

- Description: 

- % Identity: 40.8 

- Alignment Length: 121 

- Location of Alignment in SEQ ID NO 51: from 12 to 

30 131 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 52 

- Ceres seq_id .1011716 

35 - Location of start within SEQ ID NO 50: at 14 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Bacterial mutT protein 
40 - Location within SEQ ID NO 52: from 22 to 63 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 34 

- gi No. 2129134 
45 - Description: 

- % Identity: 40.8 

- Alignment Length: 121 

- Location of Alignment in SEQ ID NO 52: from 8 to 

127 

50 

(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 53 

- Ceres seq_id 1011717 

- Location of start within SEQ ID NO 50: at 185 nt . 

5 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 35 

- gi No. 2129134 
10 - Descriptioxi : 

- % Identity: 40.8 

- Alignment Length: 121 

- Location of Alignment in SEQ ID NO 53: from 1 to 

70 

15 

Maximum Length Sequence: 

related to: 
Clone IDs: 
42169 

20 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 54 

- Ceres seq_id 1011784 

- Alternative transcription start site(s) located in SEQ 
ID NO 54 : 

25 -15,-4,7, 402 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 55 

- Ceres seq_id 1011785 

30 - Location of start within SEQ ID NO 54: at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 55: at 42 aa . 



(C) Nomination and Annotation of Domains within 
35 Predicted Polypeptide ( s ) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 55: from 45 to 108 aa . 



(Dp) Related Amino Acid Sequences 
40 - Alignment No. 36 

- gi No. 543565 

- Description : 

- % Identity: 56.5 

- Alignment Length: 8 5 

45 - Location of Alignment in SEQ ID NO 55: from 29 to 

110 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 56 
50 - Ceres seq_id 1011786 

- Location of start within SEQ ID NO 54: at 49 nt . 
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- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 56: at 26 aa . 



(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 56: from 29 to 92 aa . 



(Dp) Related Amino Acid Sequences 
10 - Alignment No. 37 

- gi No. 543565 

- Description: 

- % Identity: 56.5 

- Alignment Length: 85 

15 - Location of Alignment in SEQ ID NO 56: from 13 to 

94 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 57 
20 - Ceres seq_id 1011787 

- Location of start within SEQ ID NO 54: at 3 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
25 (Dp) Related Amino Acid Sequences 



Maximum Length Sequence: 

related to: 
Clone IDs: 
30 41992 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 58 

- Ceres seq__id 1011820 

- Alternative transcription start site(s) located in SEQ 
35 ID NO 58: 

-40, 37 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 59 
40 - Ceres seq_id 1011821 

- Location of start within SEQ ID NO 58: at 1 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
45 (Dp) Related Amino Acid Sequences 

- Alignment No. 3 8 

- gi No. 3417418 

- Description: 

- % Identity: 23.6 

50 - Alignment Length: 207 
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- Location of Alignment in SEQ ID NO 59: from 24 to 

226 



(B) Polypeptide Sequence 
5 - Pat. Appln. SEQ ID NO 60 

- Ceres seq__id 1011822 

- Location of start within SEQ ID NO 58: at 13 nt . 



(C) Nomination and Annotation of Domains within 
10 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 3 9 

- gi No. 3417418 

- Description: 

15 - % Identity: 23.6 

- Alignment Length: 207 

- Location of Alignment in SEQ ID NO 60: from 20 to 



20 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 61 

- Ceres seq_id 1011823 

- Location of start within SEQ ID NO 58: at 151 nt . 



25 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 40 

- gi No. 3417418 
30 - Description: 

- % Identity: 23 . 6 

- Alignment Length: 207 

- Location of Alignment in SEQ ID NO 61: from 1 to 

17 6 

35 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

specific gene, plant 

specific gene. 

4 0 

Maximum Length Sequence: 

related to: 
Clone IDs: 
41851 

45 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 62 

- Ceres seq_id 1011874 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 63 
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- Ceres seq_id 1011875 

- Location of start within SEQ ID NO 62: at 1 nt . 

(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 41 

- gi No. 2911044 

- Description: 

10 - % Identity: 78.7 

- Alignment Length: 95 

- Location of Alignment in SEQ ID NO 63: from 28 to 121 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
15 dicot specific gene, plant specific gene. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
20 41682 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 64 

- Ceres seq_id 1011981 



25 



30 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 65 

- Ceres seq_id 1011982 

- Location of start within SEQ ID NO 64: at 68 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 42 
35 - gi No. 4115355 

- Description: 

- % Identity: 100 

- Alignment Length: 52 

- Location of Alignment in SEQ ID NO 65: from 1 to 

4 0 52 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 66 

- Ceres seq_id 1011983 

45 - Location of start within SEQ ID NO 64 : at 3 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 



50 



(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 67 

- Ceres seq_id 1011984 

- Location of start within SEQ ID NO 64: at 483 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
5 ID NO 67 : at 19 aa. 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 

10 

Maximum Length Sequence: 

related to: 
Clone IDs: 
15 38470 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 68 

- Ceres seq_id 1014547 

- Alternative transcription start site(s) located in SEQ 
20 ID NO 68: 

-39,-2,-1,2, 3,7,8,9,10 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 69 
25 - Ceres seq_id 1014548 

- Location of start within SEQ ID NO 68 : at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 69: at 32 aa. 

30 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 69: from 44 to 110 aa . 

35 (Dp) Related Amino Acid Sequences 

- Alignment No. 43 

- gi No. 3062791 

- Description: 

- % Identity: 72.2 

40 - Alignment Length: 90 

- Location of Alignment in SEQ ID NO 69: from 21 to 

110 

(B) Polypeptide Sequence 
45 - Pat. Appln. SEQ ID NO 70 

- Ceres seq_id 1014549 

- Location of start within SEQ ID NO 68: at 25 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 70: at 24 aa. 

50 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 70: from 36 to 102 aa . 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 4 4 

- gi No. 3062791 

- Description: 

10 - % Identity: 72.2 

- Alignment Length: 90 

- Location of Alignment in SEQ ID NO 70: from 13 to 

102 

15 Maximum Length Sequence: 
related to: 
Clone IDs : 
38004 

(Ac) cDNA Polynucleotide Sequence 
20 - Pat. Appln. SEQ ID NO 71 

- Ceres seq_id 1014995 

- Alternative transcription start site(s) located in SEQ 
ID NO 71: 

2,3, 4 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 7 2 

- Ceres seq_id 1014996 

- Location of start within SEQ ID NO 71: at 2 nt . 

30 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Aminotransferase class IV 

- Location within SEQ ID NO 72: from 71 to 334 aa. 

35 

(Dp) Related Amino Acid Sequences 

- Alignment No. 4 5 

- gi No. 3540183 

- Description: 

40 - % Identity: 54.7 

- Alignment Length: 287 

- Location of Alignment in SEQ ID NO 72: from 56 to 

341 

45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 7 3 

- Ceres seq_id 1014997 

- Location of start within SEQ ID NO 71: at 65 nt . 

50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 
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- Aminotransferase class IV 

- Location within SEQ ID NO 73: from 50 to 313 aa. 

(Dp) Related Amino Acid Sequences 
5 - Alignment No. 4 6 

- gi No. 3540183 

- Description: 

- % Identity: 54.7 

- Alignment Length: 287 

10 - Location of Alignment in SEQ ID NO 73: from 35 to 

320 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 74 
15 - Ceres seq_id 1014998 

- Location of start within SEQ ID NO 71: at 167 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
20 - Aminotransferase class IV 

- Location within SEQ ID NO 74: from 16 to 279 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 47 
25 - gi No. 3540183 

- Description: 

~ % Identity: 54.7 

- Alignment Length: 287 

- Location of Alignment in SEQ ID NO 74: from 1 to 

30 286 

Maximum Length Sequence: 

related to: 
Clone IDs: 
35 37701 

Public Genomic DNA: 
gi No: 4699904 
Predicted Exons: 

SINGLE 38530 ... 37988 OCKHAMG-CDS 
40 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 75 

- Ceres seq_id 1015323 

- Alternative transcription start site(s) located in SEQ 
ID NO 75: 

45 -1,5,6,12 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 76 

- Ceres seq__id 1015324 

50 - Location of start within SEQ ID NO 75: at 59 nt . 
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- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 76: at 25 aa . 

(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 4 8 

- gi No. 3860308 

- Description: 

10 - % Identity: 44.5 

- Alignment Length: 140 

- Location of Alignment in SEQ ID NO 76: from 56 to 

175 

15 (Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

specific gene, plant 

specific gene. 

2 0 Maximum Length Sequence: 
related to: 
Clone IDs: 
364 

(Ac) cDNA Polynucleotide Sequence 
25 - Pat. Appln. SEQ ID NO 77 

- Ceres seq_id 1016486 

- Alternative transcription start site(s) located in SEQ 
ID NO 77 : 

17, 19, 20, 21, 22, 23, 29, 35, 38 

30 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 78 

- Ceres seq__id 1016487 

- Location of start within SEQ ID NO 77: at 1 nt . 

35 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 78 : at 46 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
40 - Gamma-thionins family 

- Location within SEQ ID NO 78: from 55 to 101 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 4 9 
45 - gi No. 4038039 

- Description: 

- % Identity: 100 

- Alignment Length: 77 

- Location of Alignment in SEQ ID NO 78: from 25 to 

50 101 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 79 

- Ceres seq_id 1016488 

- Location of start within SEQ ID NO 77: at 73 nt . 

5 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 79: at 22 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
10 - Gamma-thionins family 

- Location within SEQ ID NO 79: from 31 to 77 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 50 
15 - gi No. 4038039 

- Description: 

- % Identity: 100 

- Alignment Length: 77 

- Location of Alignment in SEQ ID NO 79: from 1 to 

20 77 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 80 

- Ceres seq_id 1016489 

25 - Location of start within SEQ ID NO 77: at 118 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Gamma-thionins family 

30 - Location within SEQ ID NO 80: from 16 to 62 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 51 

- gi No. 4038039 
35 - Description: 

- % Identity: 100 

- Alignment Length: 77 

- Location of Alignment in SEQ ID NO 80: from 1 to 

62 

40 

Maximum Length Sequence: 

related to: 
Clone IDs: 
33891 

45 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 81 

- Ceres seq_id 1018341 

- Alternative transcription start site(s) located in SEQ 
ID NO 81: 

50 4 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 82 

- Ceres seq_id 1018342 

- Location of start within SEQ ID NO 81: at 71 nt . 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 52 
10 - gi No. 1064887 

- Description: 

- % Identity: 60 

- Alignment Length: 65 

- Location of Alignment in SEQ ID NO 82: from 2 to 

15 66 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 8 3 

- Ceres seq_id 1018343 

20 - Location of start within SEQ ID NO 81: at 143 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
25 - Alignment No. 53 

- gi No. 1064887 

- Description: 

- % Identity: 60 

- Alignment Length: 65 

30 - Location of Alignment in SEQ ID NO 83: from 1 to 

42 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 8 4 
35 - Ceres seq_id 1018344 

- Location of start within SEQ ID NO 81: at 146 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
40 (Dp) Related Amino Acid Sequences 

- Alignment No . 5 4 

- gi No. 1064887 

- Description: 

- % Identity: 60 

4 5 - Alignment Length: 65 

- Location of Alignment in SEQ ID NO 84: from 1 to 

41 



(Ba) Polypeptide Activities: Similar to pollen coat protein 
50 activities and LEA protein activities. 
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Maximum Length Sequence: 

related to: 
Clone IDs: 
33828 

5 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 85 

- Ceres seq_id 1018382 

- Alternative transcription start site(s) located in SEQ 
ID NO 85: 

10 2, 4, 5, 6, 7, 8, 9, 10, 11, 14 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 8 6 

- Ceres seq_id 1018383 

15 - Location of start within SEQ ID NO 85: at 22 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 86: at 24 aa. 

(C) Nomination and Annotation of Domains within 
20 Predicted Polypeptide ( s ) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 86: from 28 to 115 aa. 

(Dp) Related Amino Acid Sequences 
25 - Alignment No. 55 

- gi No. 899224 

- Description: 

- % Identity: 78.2 

- Alignment Length: 119 

30 - Location of Alignment in SEQ ID NO 86: from 1 to 

119 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 8 7 
35 - Ceres seq_id 1018384 

- Location of start within SEQ ID NO 85: at 73 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
40 - Plant lipid transfer protein family 

- Location within SEQ ID NO 87: from 11 to 98 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 56 
45 - gi No. 899224 

- Description: 

- % Identity: 78.2 

- Alignment Length: 119 

- Location of Alignment in SEQ ID NO 87: from 1 to 

50 102 



WO 00/40695 



PCT/US00/00466 



137 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 88 

- Ceres seq_id 1018385 

- Location of start within SEQ ID NO 85: at 3 nt . 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 



10 Maximum Length Sequence: 
related to: 
Clone IDs: 
30349 

(Ac) cDNA Polynucleotide Sequence 
15 - Pat. Appln. SEQ ID NO 89 

- Ceres seq_id 1020666 

- Alternative transcription start site(s) located in SEQ 
ID NO 89: 

33,35,39,40,42,43,44,45,64,173 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 90 

- Ceres seq_id 1020667 

- Location of start within SEQ ID NO 89: at 118 nt . 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S7e 

- Location within SEQ ID NO 90: from 7 to 187 aa. 

30 

(Dp) Related Amino Acid Sequences 

- Alignment No. 57 

- gi No. 3851636 

- Description: 

35 - % Identity: 77.4 

- Alignment Length: 190 

- Location of Alignment in SEQ ID NO 90: from 1 to 



40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 91 

- Ceres seq_id 1020668 

- Location of start within SEQ ID NO 89: at 271 nt . 

45 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Ribosomal protein S7e 

- Location within SEQ ID NO 91: from 1 to 136 aa. 



50 



(Dp) Related Amino Acid Sequences 
- Alignment No. 58 
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- gi No. 3851636 

- Description: 

- % Identity: 77.4 

- Alignment Length: 190 

5 - Location of Alignment in SEQ ID NO 91: from 1 to 

139 

Maximum Length Sequence: 
related to: 
10 Clone IDs: 
30113 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 92 

- Ceres seq_id 1020784 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 93 

- Ceres seq__id 1020785 

20 - Location of start within SEQ ID NO 92: at 60 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 93: at 25 aa. 

(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 59 

- gi No. 3860308 

- Description: 

30 - % Identity: 44.5 

- Alignment Length: 14 0 

- Location of Alignment in SEQ ID NO 93: from 56 to 

175 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 



4 0 Maximum Length Sequence: 
related to: 
Clone IDs: 

29120 
Public Genomic DNA: 
45 gi No: 5822667 

Predicted Exons : 

INTR 68772 . . . 69532 OCKHAMG-CDNA 

SINGLE 68846 ... 69325 OCKHAMG-CDS 
50 gi No: 6041831 

Predicted Exons: 
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INTR 63702 ... 64462 OCKHAMG-CDNA 

SINGLE 63776 ... 64255 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 94 

- Ceres seq_id 1021525 

- Alternative transcription start site(s) located in SEQ 
ID NO 94: 

25,26,27,28,29,35,36,39,51,53,54,68 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 95 

- Ceres seq_id 1021526 

- Location of start within SEQ ID NO 94: at 75 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 60 

- gi No. 4388980 

- Description: 

- % Identity: 29.1 

- Alignment Length: 110 

- Location of Alignment in SEQ ID NO 95: from 47 to 

156 

(Ba) Polypeptide Activities: Similar to adrenodoxi precursor 
protein activities and adrenal ferredoxin activities. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
2891 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 96 

- Ceres seq_id 1021563 

- Alternative transcription start site(s) located in SEQ 
ID NO 96: 

16, 28, 29, 30, 31, 35, 36, 43, 74, 77, 80, 88, 89, 90, 95 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 97 

- Ceres seq_id 1021564 

- Location of start within SEQ ID NO 96: at 2 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
- Alignment No. 61 
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- gi No. 2493089 

- Description: 

- % Identity: 29.7 

- Alignment Length: 77 

5 - Location of Alignment in SEQ ID NO 97: from 52 to 

125 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 98 
10 - Ceres seq_id 1021565 

- Location of start within SEQ ID NO 96: at 116 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 (Dp) Related Amino Acid Sequences 

- Alignment No. 62 

- gi No. 2493089 

- Description: 

- % Identity: 29.7 

20 - Alignment Length: 77 

- Location of Alignment in SEQ ID NO 98: from 14 to 

87 

(Ba) Polypeptide Activities: Similar to ATPK-mouse 

25 activities, ATP synthase activities, and mitochondrial F- 
Chain activities. 

Maximum Length Sequence: 
related to: 
30 Clone IDs: 

28979 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 99 

- Ceres seq_id 1021576 

35 - Alternative transcription start site(s) located in SEQ 

ID NO 99: 

-13,- 

11, 3, 5, 7, 11, 12, 13, 15, 24, 26, 28, 29, 31, 34, 35, 36, 37, 38, 39 

40 43, 44, 45, 46, 47, 48, 50, 51, 52, 56, 58, 59, 61, 67, 68, 70, 75, 82, 83, 91, 1 

12 

244, 313, 318 

(B) Polypeptide Sequence 
45 - Pat. Appln. SEQ ID NO 100 

- Ceres seq__id 1021577 

- Location of start within SEQ ID NO 99: at 109 nt . 

(C) Nomination and Annotation of Domains within 
50 Predicted Polypeptide ( s ) 

- Photosystem I psaG / psaK 
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- Location within SEQ ID NO 100: from 50 to 130 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 63 

- gi No. 3885511 

- Description: 

- % Identity: 81.1 

- Alignment Length: 128 

- Location of Alignment in SEQ ID NO 100: from 1 to 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 101 

- Ceres seq_id 1021578 

- Location of start within SEQ ID NO 99: at 121 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Photosystem I psaG / psaK 

- Location within SEQ ID NO 101: from 46 to 126 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 64 

- gi No. 3885511 

- Description: 

- % Identity: 81.1 

- Alignment Length: 128 

- Location of Alignment in SEQ ID NO 101: from 1 to 

123 

i 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 102 

- Ceres seq_id 1021579 

- Location of start within SEQ ID NO 99: at 124 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Photosystem I psaG / psaK 

- Location within SEQ ID NO 102: from 45 to 125 aa. 

3 

(Dp) Related Amino Acid Sequences 

- Alignment No. 65 

- gi No. 3885511 

- Description: 

5 - % Identity: 81.1 

- Alignment Length: 128 

- Location of Alignment in SEQ ID NO 102: from 1 to 

122 

0 Maximum Length Sequence: 
related to: 
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28177 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 103 

- Ceres seq_id 1021927 

- Alternative transcription start site(s) located in SEQ 
ID NO 103: 

2,12,20,23,29,36,46,47,53 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 104 

- Ceres seq_id 1021928 

- Location of start within SEQ ID NO 103: at 67 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 66 

- gi No. 4263779 

- Description: 

- % Identity: 28.2 

- Alignment Length: 177 

- Location of Alignment in SEQ ID NO 104: from 14 

to 182 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 105 

- Ceres seq_id 1021929 

- Location of starr within SEQ ID NO 103: at 172 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 105: at 23 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 67 

- gi No. 4263779 

- Description: 

- % Identity: 28.2 

- Alignment Length: 177 

- Location of Alignment in SEQ ID NO 105: from 1 to 

147 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 106 

- Ceres seq_id 1021930 

- Location of start within SEQ ID NO 103: at 220 nt . 



0 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
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- Alignment No. 68 

- gi No. 4263779 

- Description: 

- % Identity: 28.2 

5 - Alignment Length: 177 

- Location of Alignment in SEQ ID NO 106: from 1 to 

131 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
10 dicot specific gene, plant specific gene. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
15 2807 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 107 

- Ceres seq_id 1021945 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 108 

- Ceres seq__id 1021946 

- Location of start within SEQ ID NO 107: at 1 nt . 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 6 9 
30 - gi No. 3334132 

- Description: 

- % Identity: 27.4 

- Alignment Length: 114 

- Location of Alignment in SEQ ID NO 108: from 28 

35 to 136 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 109 

- Ceres seq_id 1021947 

40 - Location of start within SEQ ID NO 107: at 163 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
45 - Alignment No. 70 

- gi No. 3334132 

- Description: 

~ % Identity: 27.4 

- Alignment Length: 114 

50 - Location of Alignment in SEQ ID NO 109: from 1 to 

82 
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(Ba) Polypeptide Activities: Similar to C21 ORF4 - membrane 
protein activities 

5 Maximum Length Sequence: 
related to: 
Clone IDs: 
27792 

(Ac) cDNA Polynucleotide Sequence 
10 - Pat. Appln. SEQ ID NO 110 

- Ceres seq_id 1022170 

- Alternative transcription start site(s) located in SEQ 
ID NO 110: 

-4, -1, 32, 68 

15 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 111 

- Ceres seq_id 1022171 

- Location of start within SEQ ID NO 110: at 92 nt . 

20 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 71 
25 - gi No. 1173456 

- Description: 

- % Identity: 54.7 

- Alignment Length: 12 9 

- Location of Alignment in SEQ ID NO 111: from 4 to 

30 131 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 112 

- Ceres seq_id 1022172 

35 - Location of start within SEQ ID NO 110: at 191 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
40 - Alignment No. 72 

- gi No. 1173456 

- Description: 

- % Identity: 54.7 

- Alignment Length: 12 9 

45 - Location of Alignment in SEQ ID NO 112: from 1 to 

98 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 113 
50 - Ceres seq_id 1022173 

- Location of start within SEQ ID NO 110: at 1 nt . 
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- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 113: at 19 aa . 

(Ba) Polypeptide Activities: Similar to small nuclear 
r ibonucleoprotein activities . 

Maximum Length Sequence: 

related to: 
Clone IDs: 

27167 
Public Genomic DNA: 

gi No: 3046850 

Predicted Exons : 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 114 

- Ceres seq_id 1022554 

- Alternative transcription start site(s) located in SEQ 
ID NO 114 : 

-49, - 

19,7,22,23,28,29,30,31, 32 ,33,35, 36, 37 ,51,57, 67 ,75,81,82 

85 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 115 

- Ceres seq_id 1022555 

- Location of start within SEQ ID NO 114: at 107 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Heme-binding domain in cytochrome b5 and 
oxidoreductases 

- Location within SEQ ID NO 115: from 7 to 84 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 73 

- gi No. 4240122 

- Description: 

- % Identity: 100 

- Alignment Length: 140 

- Location of Alignment in SEQ ID NO 115: from 1 to 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 116 

- Ceres seq_id 1022556 

- Location of start within SEQ ID NO 114: at 317 nt . 



INIT 
INTR 
TERM 



45217 
44695 
44554 



45131 
44629 
44286 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



140 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 



- Alignment No. 7 4 

- gi No. 4240122 

- Description: 

- % Identity: 100 

- Alignment Length: 140 

- Location of Alignment in SEQ ID NO 116: from 1 to 



Maximum Length Sequence: 
related to: 

Clone IDs: 
27109 

Public Genomic DNA: 
gi No: 6449507 
Predicted Exons : 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 117 

- Ceres seq_id 1022594 

- Alternative transcription start site(s) located in SEQ 
ID NO 117 : 

2, 15,24,25, 66, 69,72,74 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 118 

- Ceres seq__id 1022595 

- Location of start within SEQ ID NO 117: at i nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Universal stress protein family 

- Location within SEQ ID NO 118: from 98 to 183 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 7 5 

- gi No. 2160182 

- Description: 

- % Identity: 37.5 

- Alignment Length: 160 

- Location of Alignment in SEQ ID NO 118: from 39 



INIT 
INTR 
INTR 
TERM 



94711 
94417 
94249 
94046 



94519 
94326 
94131 
93968 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



to 186 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 119 

- Ceres seq_id 1022596 
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- Location of start within SEQ ID NO 117: at 82 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
5 - Universal stress protein family 

- Location within SEQ ID NO 119: from 71 to 156 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 7 6 
10 - gi No. 2160182 

- Description: 

- % Identity: 37.5 

- Alignment Length: 160 

- Location of Alignment in SEQ ID NO 119: from 12 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 120 

- Ceres seq_id 1022597 

20 - Location of start within SEQ ID NO 117: at 106 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Universal stress protein family 
25 - Location within SEQ ID NO 120: from 63 to 148 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 77 

- gi No. 2160182 
30 - Description: 

- % Identity: 37.5 

- Alignment Length: 160 

- Location of Alignment in SEQ ID NO 120: from 4 to 

151 

35 

(Ba) Polypeptide Activities: Similar to protein in 
methanobacterium thermoautot rophicum activities . 

Maximum Length Sequence: 
40 related to: 

Clone IDs: 

26994 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 121 
45 - Ceres seq_id 1022621 

- Alternative transcription start site(s) located in SEQ 
ID NO 121: 

2,7,9, 13, 35, 38, 45, 57 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 122 
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- Ceres seq_id 1022622 

- Location of start within SEQ ID NO 121: at 86 nt . 

(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 7 8 

- gi No. 3256599 

- Description: 

10 - % Identity: 32 

- Alignment Length: 128 

- Location of Alignment in SEQ ID NO 122: from 7 to 

128 

15 (Ba) Polypeptide Activities: Similar to structural cell wall 
protein activities, and larval gene protein in the Fruit fly 
activities . 



20 Maximum Length Sequence: 
related to: 
Clone IDs: 
23518 

(Ac) cDNA Polynucleotide Sequence 
25 - Pat. Appln. SEQ ID NO 123 

- Ceres seq_id 1024375 

- Alternative transcription start site(s) located in SEQ 
ID NO 123: 

7, 8, 10, 17 ,18, 24,35, 41, 42, 43, 44, 46, 48, 52, 54, 59 



30 



35 



40 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 124 

- Ceres seq_id 1024376 

- Location of start within SEQ ID NO 123: at 130 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S7e 

- Location within SEQ ID NO 124: from 7 to 187 aa , 



(Dp) Related Amino Acid Sequences 

- Alignment No. 7 9 

- gi No. 3851636 

- Description: 

45 - % Identity: 76.6 

- Alignment Length: 188 

- Location of Alignment in SEQ ID NO 124: from 1 to 

188 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 125 
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- Ceres seq_id 1024377 

- Location of start within SEQ ID NO 123: at 283 nt 



(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

- Ribosomal protein S7e 

- Location within SEQ ID NO 125: from 1 to 136 aa . 



(Dp) Related Amino Acid Sequences 
10 - Alignment No. 80 

- gi No. 3851636 

- Description: 

- % Identity: 76.6 

- Alignment Length: 188 

15 - Location of Alignment in SEQ ID NO 125: from 1 to 

137 



Maximum Length Sequence: 
related to: 
20 Clone IDs: 
23170 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 126 

- Ceres seq_id 1024535 

25 - Alternative transcription start site(s) located in SEQ 

ID NO 126: 

2, 8, 11, 31, 46, 47, 48 



(B) Polypeptide Sequence 
30 - Pat. Appln. SEQ ID NO 127 

- Ceres seq__id 1024536 

- Location of start within SEQ ID NO 126: at 115 nt. 



(C) Nomination and Annotation of Domains within 
35 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 81 

- gi No. 2621731 

- Description: 

40 - % Identity: 35.2 

- Alignment Length: 88 

~ Location of Alignment in SEQ ID NO 127: from 5 to 



45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 128 

- Ceres seq_id 1024537 

- Location of start within SEQ ID NO 126: at 3 nt . 

50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 129 
5 - Ceres seq_id 1024538 

- Location of start within SEQ ID NO 126: at 253 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
10 (Dp) Related Amino Acid Sequences 

- Alignment No. 82 

- gi No. 2621731 

- Description: 

- % Identity: 35.2 

15 - Alignment Length: 88 

- Location of Alignment in SEQ ID NO 129: from 1 to 

46 

(Ba) Polypeptide Activities: Similar to small nuclear 
20 ribonucleoprotein activities. 



Maximum Length Sequence: 
related to: 
25 Clone IDs: 
21228 
Public Genomic DNA: 
gi No: 4539402 
Predicted Exons : 



INIT 


36726 . . . 


36352 


GENBANK 




INTR 


36257 . . . 


36135 


GEN BANK 




TERM 


35555 . . . 


35469 


GENBANK 




INTR 


36791 ... 


36352 


OCKHAMG- 


CDNA 


INTR 


36257 . . . 


36135 


OCKHAMG- 


CDNA 


INTR 


35555 . . . 


35325 


OCKHAMG- 


CDNA 


INIT 


36726 . . . 


36352 


OCKHAMG- 


-CDS 


INTR 


36257 . . . 


36135 


OCKHAMG- 


CDS 


TERM 


35555 . . . 


35469 


OCKHAMG- 


•CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 130 

- Ceres seq_id 1025683 

- Alternative transcription start site(s) located in SEQ 
45 ID NO 130: 

2, 8,22 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 131 
50 - Ceres seq_id 1025684 

- Location of start within SEQ ID NO 130: at 3 nt. 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein L6 
5 - Location within SEQ ID NO 131: from 34 to 216 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 8 3 

- gi No. 266945 
10 - Description: 

- % Identity: 84 

- Alignment Length: 194 

- Location of Alignment in SEQ ID NO 131: from 23 

to 216 

15 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 132 

- Ceres seq_id 1025685 

- Location of start within SEQ ID NO 130: at 69 nt . 

20 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein L6 

- Location within SEQ ID NO 132: from 12 to 194 aa. 

25 

(Dp) Related Amino Acid Sequences 

- Alignment No. 8 4 

- gi No. 266945 

- Description: 

30 - % Identity: 84 

- Alignment Length: 194 

- Location of Alignment in SEQ ID NO 132: from 1 to 

194 



35 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 133 

- Ceres seq_id 1025686 

- Location of start within SEQ ID NO 130: at 96 nt . 

40 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Ribosomal protein L6 

- Location within SEQ ID NO 133: from 3 to 185 aa. 



45 (Dp) Related Amino Acid Sequences 

- Alignment No. 85 

- gi No. 266945 

- Description: 

- % Identity: 84 

50 - Alignment Length: 194 
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- Location of Alignment in SEQ ID NO 133: from 1 to 

185 

Maximum Length Sequence: 
5 related to: 

Clone IDs: 
19274 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 134 
10 - Ceres seq_id 1027152 

- Alternative transcription start site(s) located in SEQ 
ID NO 13 4 : 

-350, 3, 4, 10, 11, 13, 222 

15 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 135 

- Ceres seq_id 1027153 

- Location of start within SEQ ID NO 134: at 3 nt . 

20 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 8 6 

- gi No. 2879811 
25 - Description: 

- % Identity: 86.6 

- Alignment Length: 112 

- Location of Alignment in SJH ID NO 135: from 26 

to 137 



30 



35 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 136 

- Ceres seq_id 1027154 

- Location of start within SEQ ID NO 134: at 78 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 87 
40 - gi No. 2879811 

- Description: 

- % Identity: 86.6 

- Alignment Length: 112 

- Location of Alignment in SEQ ID NO 136: from 1 to 

45 112 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 137 

- Ceres seq_id 1027155 

50 - Location of start within SEQ ID NO 134: at 141 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 88 
5 - gi No. 2879811 

- Description: 

- % Identity: 86.6 

- Alignment Length: 112 

- Location of Alignment in SEQ ID NO 137: from 1 to 

10 91 



(Ba) Polypeptide Activities: Similar to ribosomal L30 
protein activities . 

15 

Maximum Length Sequence: 

related to: 
Clone IDs: 
17835 

20 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 138 

- Ceres seq__id 1028095 

- Alternative transcription start site(s) located in SEQ 
ID NO 138: 

25 -2, 2, 3,4,5, 6, 12 ,14,18, 22, 26, 40, 42,44, 45,46, 47 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 13 9 

- Ceres seq_id 1028096 

30 - Location of start within SEQ ID NO 138: at 55 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 139: at 29 aa . 



(C) Nomination and Annotation of Domains within 
35 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 8 9 

- gi No. 4336325 

- Description: 

40 - % Identity: 31.7 

- Alignment Length: 12 6 

- Location of Alignment in SEQ ID NO 139: from 15 



45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 140 

- Ceres seq_id 1028097 

- Location of start within SEQ ID NO 138: at 214 nt . 



50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 90 

- gi No. 4336325 

- Description: 

5 - % Identity: 31.7 

- Alignment Length: 126 

- Location of Alignment in SEQ ID NO 140: from 1 to 



10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 141 

- Ceres seq_id 1028098 

- Location of start within SEQ ID NO 138: at 3 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
15 ID NO 141: at 16 aa . 



(Ba) Polypeptide Activities: Similar to human C214 membrane 
protein activities . 



20 



Maximum Length Sequence: 

related to: 
Clone IDs: 
17075 

25 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 142 

- Ceres seq__id 1028608 

- Alternative transcription start site(s) located in SEQ 
ID NO 142 : 

30 -4,2,28,31,36,49,59 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 143 

- Ceres seq_id 1028609 

35 - Location of start within SEQ ID NO 142: at 95 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
40 - Alignment No. 91 

- gi No. 2735528 

- Description: 

- % Identity: 33.9 

- Alignment Length: 118 

45 - Location of Alignment in SEQ ID NO 143: from 64 

to 178 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 144 
50 - Ceres seq__id 1028610 

- Location of start within SEQ ID NO 142: at 176 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
5 - Alignment No. 92 

- gi No. 2735528 

- Description: 

- % Identity: 33.9 

- Alignment Length: 118 

10 - Location of Alignment in SEQ ID NO 144: from 37 

to 151 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 145 
15 - Ceres seq_id 1028611 

- Location of start within SEQ ID NO 142: at 381 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 145: at 41 aa . 

20 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

Maximum Length Sequence: 
25 related to: 

Clone IDs: 
1505 

(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 146 
30 - Ceres seq_id 1030069 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID MO 14 7 
35 - Ceres seq_id 1030070 

- Location of start within SEQ ID NO 146: at 2 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
40 (Dp) Related Amino Acid Sequences 

- Alignment No. 93 

- gi No. 4335755 

- Description: 

- % Identity: 63.6 

45 - Alignment Length: 143 

- Location of Alignment in SEQ ID NO 147: from 34 

to 171 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 148 

- Ceres seq_id 1030071 
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- Location of start within SEQ ID NO 146: at 50 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
5 (Dp) Related Amino Acid Sequences 

- Alignment No. 94 

- gi No. 4335755 

- Description: 

- % Identity: 63.6 

10 - Alignment Length: 14 3 

- Location of Alignment in SEQ ID NO 148: from 18 

to 155 



(B) Polypeptide Sequence 
15 - Pat. Appln. SEQ ID NO 149 

- Ceres seq_id 1030072 

- Location of start within SEQ ID NO 146: at 170 nt . 



(C) Nomination and Annotation of Domains within 
20 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 95 

- gi No. 4335755 

- Description: 

25 - % Identity: 63.6 

- Alignment Length: 143 

- Location of Alignment in SEQ ID NO 149: from 1 to 

115 



30 (Ba) Polypeptide Activities: Similar to hydroxyproline-rich 
protein activities . 



Maximum Length Sequence: 
35 related to: 
Clone IDs: 
12487 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 150 
40 - Ceres seq_id 1032069 

- Alternative transcription start site(s) located in SEQ 
ID NO 150 : 



45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 151 

- Ceres seq_id 1032070 

- Location of start within SEQ ID NO 150: at 74 nt . 



50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 9 6 

- gi No. 3386621 

- Description: 

5 - % Identity: 97.4 

- Alignment Length: 234 

- Location of Alignment in SEQ ID NO 151: from 1 to 

231 

10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 152 

- Ceres seq_id 1032071 

- Location of start within SEQ ID NO 150: at 122 nt . 

15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 97 

- gi No. 3386621 
20 - Description: 

- % Identity: 97.4 

- Alignment Length: 234 

- Location of Alignment in SEQ ID NO 152: from 1 to 

215 

25 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 

Maximum Length Sequence: 
30 related to: 

Clone IDs: 
11466 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 153 
35 - Ceres seq_id 1033557 

- Alternative transcription start site(s) located in SEQ 
ID NO 153: 

62, 64, 65, 67, 72, 73, 74, 75, 166 

40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 154 

- Ceres seq_id 1033558 

- Location of start within SEQ ID NO 153: at 94 nt . 

45 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Ribosomal protein L34 

- Location within SEQ ID NO 154: from 105 to 145 

aa . 

50 

(Dp) Related Amino Acid Sequences 
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- Alignment No, 98 

- gi No. 132909 

- Description: 

- % Identity: 70 

5 - Alignment Length: 30 

- Location of Alignment in SEQ ID NO 154: from 116 

to 145 

(Ba) Polypeptide Activities: Similar to 50S ribosomal 
10 protein L34 activities. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
15 21589 
106951 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 155 

- Ceres seq_id 1034688 

20 - Alternative transcription start site(s) located in SEQ 

ID NO 155: 
2 

- Clone 21589 starts at 2 and ends at in cDNA. 

25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 156 

- Ceres seq_id 1034689 

- Location of start within SEQ ID NO 155: at 2 nt . 

30 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 99 

- gi No. 4335755 
35 - Description: 

- % Identity: 65 

- Alignment Length: 143 

- Location of Alignment in SEQ ID NO 156: from 33 

to 170 

40 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 157 

- Ceres seq_id 1034690 

- Location of start within SEQ ID NO 155: at 47 nt . 

45 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 100 
50 - gi No. 4335755 

- Description: 
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- % Identity: 65 

- Alignment Length: 143 

- Location of Alignment in SEQ ID NO 157: from 18 

to 155 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 158 

- Ceres seq_id 1034691 

- Location of start within SEQ ID NO 155: at 167 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 101 

- gi No. 4335755 

- Description: 

- % Identity: 65 

- Alignment Length: 143 

- Location of Alignment in SEQ ID NO 158: from 1 to 

115 

(Ba) Polypeptide Activities: Similar to hydroxy proline rich 
glycoprotein activities . 

Maximum Length Sequence: 

related to: 
Clone IDs: 
10433 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 159 

- Ceres seq__id 1035033 

- Alternative transcription start site(s) located in SEQ 
ID NO 159: 

32 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 160 

- Ceres seq_id 1035034 

- Location of start within SEQ ID NO 159: at 3 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 160: at 47 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 102 

- gi No. 3062795 

- Description: 

- % Identity: 39.8 

- Alignment Length: 83 
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10 



160 

- Location of Alignment in SEQ ID NO 160: from 2 4 

to 99 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 161 

- Ceres seq_id 1035035 

- Location of start within SEQ ID NO 159: at 69 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 161: at 25 aa . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 103 
15 - gi No. 3062795 

- Description: 

- % Identity: 39.8 

- Alignment Length: 83 

- Location of Alignment in SEQ ID NO 161: from 2 to 

20 77 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 162 

- Ceres seq_id 1035036 

25 - Location of start within SEQ ID NO 159: at 72 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 162: at 24 aa. 

(C) Nomination and Annotation of Domains within 
30 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 104 

- gi No. 3062795 

- Description: 

35 - % Identity: 39.8 

- Alignment Length: 8 3 

- Location of Alignment in SEQ ID NO 162: from 1 to 

7 6 

40 (Ba) Polypeptide Activities: Similar to Pollen coat protein 
activities . 



Maximum Length Sequence: 
45 related to: 

Clone IDs: 
10394 

(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 163 
50 - Ceres seq_id 1035071 
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- Alternative transcription start site(s) located in SEQ 
ID NO 163: 

-2,-1,2,3, 18,19,21,24,25,56,66 

5 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 164 

- Ceres seq_id 1035072 

- Location of start within SEQ ID NO 163: at 66 nt . 

10 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- KH domain 

- Location within SEQ ID NO 164: from 47 to 95 aa . 

15 (Dp) Related Amino Acid Sequences 

- Alignment No. 105 

- gi No. 133940 

- Description: 

- % Identity: 76 

20 - Alignment Length: 246 

- Location of Alignment in SEQ ID NO 164: from 1 to 

148 

(B) Polypeptide Sequence 
25 - Pat. Appln. SEQ ID NO 165 

- Ceres seq_id 1035073 

- Location of start within SEQ ID NO 163: at 195 nt . 

(C) Nomination and Annotation of Domains within 
30 Predicted Polypeptide ( s ) 

- KH domain 

- Location within SEQ ID NO 165: from 4 to 52 aa . 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 106 

- gi No. 133940 

- Description: 

- % Identity: 76 

- Alignment Length: 246 

40 - Location of Alignment in SEQ ID NO 165: from 1 to 

105 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 166 
45 - Ceres seq_id 1035074 

- Location of start within SEQ ID NO 163: at 513 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
50 (Dp) Related Amino Acid Sequences 

- Alignment No. 107 
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- gi No. 133940 

- Description: 

- % Identity: 76 

- Alignment Length: 24 6 

5 - Location of Alignment in SEQ ID NO 166: from 1 to 

97 



Maximum Length Sequence: 
related to: 
10 Clone IDs: 
10511 
Public Genomic DNA: 
gi No: 4539290 
Predicted Exons : 
15 INTR 5588 . . . 5314 OCKHAMG-CDNA 



INTR 5517 . . . 5314 OCKHAMG-CDNA 
gi No: 4914454 
Predicted Exons: 
20 INTR 38937 . . . 38663 OCKHAMG-CDNA 



INTR 38866 ... 38663 OCKHAMG-CDNA 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 167 
25 - Ceres seq_id 1376587 

- Alternative transcription start site(s) located in SEQ 
ID NO 167 : 

8,9,10,12,13,14,15,16,17,30,34,39,41 



30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 168 

- Ceres seq__id 1376588 

- Location of start within SEQ ID NO 167: at 2 nt . 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 108 

- gi No. 4539292 
40 - Description: 

- % Identity: 99.4 

- Alignment Length: 177 

- Location of Alignment in SEQ ID NO 168: from 26 

to 202 

45 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 169 

- Ceres seq_id 1376589 

- Location of start within SEQ ID NO 167: at 77 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 109 

- gi No. 4539292 

- Description: 

- % Identity: 99.4 

- Alignment Length: 177 

- Location of Alignment in SEQ ID NO 169: from 1 to 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 170 

- Ceres seq_id 1376590 

- Location of start within SEQ ID NO 167: at 212 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 110 

- gi No. 4539292 

- Description: 

- % Identity: 99.4 

- Alignment Length: 177 

- Location of Alignment in SEQ ID NO 170: from 1 to 



(Ba) Polypeptide Activities: Similar 40S ribosomal protein 
activities . 



Maximum Length Sequence: 

related to: 
Clone IDs: 

112110 
Public Genomic DNA : 
gi No: 4263774 
Predicted Exons : 
INIT 1816 
INTR 1290 
TERM 959 
gi No: 4510360 
Predicted Exons: 

INIT 114707 . . . 
INTR 114181 . . , 
TERM 113850 . . , 
(Ac) cDNA Polynucleotide 

- Pat. Appln. SEQ ID NO 

- Ceres seq_id 1378581 

- Alternative transcription 
ID NO 171 : 

-35,-4,-3,18,20 



1814 
1112 
803 



114705 
114003 
113694 
Sequence 
171 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



start site(s) located in SEQ 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 172 

- Ceres seq_id 1378582 

5 - Location of start within SEQ ID NO 171: at 112 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 172: at 17 aa. 

(C) Nomination and Annotation of Domains within 
10 Predicted Polypeptide ( s ) 

- DnaJ domain 

- Location within SEQ ID NO 172: from 57 to 101 aa. 

(Dp) Related Amino Acid Sequences 
15 - Alignment No. Ill 

- gi No. 4263775 

- Description: 

- % Identity: 100 

- Alignment Length: 112 

20 - Location of Alignment in SEQ ID NO 172: from 1 to 

112 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 173 
25 - Ceres seq_id 1378583 

- Location of start within SEQ ID NO 171: at 256 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
30 - DnaJ domain 

- Location within SEQ ID NO 173: from 9 to 53 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 112 
35 - gi No. 4263775 

- Description: 

- % Identity: 100 

- Alignment Length: 112 

- Location of Alignment in SEQ ID NO 173: from 1 to 

40 64 

Maximum Length Sequence: 

related to: 
Clone IDs: 
45 13599 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 174 

- Ceres seq_id 1383462 

- Alternative transcription start site(s) located in SEQ 
50 ID NO 174: 

-2,2,3,4,5,6,8,14, 18, 24 



WO 00/40695 



PCT/US00/00466 



165 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 175 

- Ceres seq__id 1383463 

5 - Location of start within SEQ ID NO 174: at 3 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

10 - Location within SEQ ID NO 175: from 46 to 119 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 113 

- gi No. 3128176 
15 - Description: 

- % Identity: 37 

- Alignment Length: 194 

- Location of Alignment in SEQ ID NO 175: from 23 

to 197 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 17 6 

- Ceres seq_id 1383464 

- Location of start within SEQ ID NO 174: at 42 nt . 
25 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 176: at 24 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
30 - Plant lipid transfer protein family 

- Location within SEQ ID NO 176: from 33 to 106 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 114 
35 - gi No. 3128176 

- Description: 

- % Identity: 37 

- Alignment Length: 194 

- Location of Alignment in SEQ ID NO 17 6: from 10 

40 to 184 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 177 

- Ceres seq_id 1383465 

45 - Location of start within SEQ ID NO 174: at 90 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

50 - Location within SEQ ID NO 177: from 17 to 90 aa. 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 115 

- gi No. 3128176 

- Description: 

5 - % Identity: 37 

- Alignment Length: 194 

- Location of Alignment in SEQ ID NO 177: from 1 to 

168 



10 Maximum Length Sequence: 
related to: 
Clone IDs: 

156375 

(Ac) cDNA Polynucleotide Sequence 
15 - Pat. Appln. SEQ ID NO 178 

- Ceres seq_id 1386215 

- Alternative transcription start site(s) located in SEQ 
ID NO 178 : 

-38, 12, 17, 18, 19,20,26 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 179 

- Ceres seq_id 1386216 

- Location of start within SEQ ID NO 178: at 2 nt . 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Photosystem I psaG / psaK 

- Location within SEQ ID NO 179: from 77 to 138 aa . 

30 

(Dp) Related Amino Acid Sequences 

- Alignment No. 116 

- gi No. 3885511 

- Description: 

35 - % Identity: 79.3 

- Alignment Length: 112 

- Location of Alignment in SEQ ID NO 179: from 28 

to 138 



40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 180 

- Ceres seq_id 1386217 

- Location of start within SEQ ID NO 178: at 83 nt . 



45 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Photosystem I psaG / psaK 

- Location within SEQ ID NO 180: from 50 to 111 aa . 



50 



(Dp) Related Amino Acid Sequences 
- Alignment No. 117 
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- gi No. 3885511 

- Description: 

- % Identity: 79.3 

- Alignment Length: 112 

5 - Location of Alignment in SEQ ID NO 180: from 1 to 

111 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 181 
10 - Ceres seq_id 1386218 

- Location of start within SEQ ID NO 178: at 95 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 - Photosystem I psaG / psaK 

- Location within SEQ ID NO 181: from 46 to 107 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 118 
20 - gi No. 3885511 

- Description: 

- % Identity: 79.3 

- Alignment Length: 112 

- Location of Alignment in SEQ ID NO 181: from 1 to 

25 107 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 182 

- Ceres seq_id 2025156 

30 - Location of start within SEQ ID NO 178: at 502 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

35 

Maximum Length Sequence: 

related to: 
Clone IDs: 
21233 

40 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 183 

- Ceres seq_id 1388499 

- Alternative transcription start site(s) located in SEQ 
ID NO 183: 

45 2,5, 6, 10, 16, 30 ,89, 34 6, 34 9 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 184 

- Ceres seq_id 1388500 

50 - Location of start within SEQ ID NO 183: at 62 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No* 119 

- gi No.. 2829899 

- Description: 

- % Identity: 49.3 

- Alignment Length: 150 

- Location of Alignment in SEQ ID NO 184: from 2 to 

150 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 185 

- Ceres seq_id 1388501 

- Location of start within SEQ ID NO 183: at 122 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 120 

- gi No. 2829899 

- Description: 

- % Identity: 49.3 

- Alignment Length: 150 

- Location of Alignment in SEQ ID NO 185: from 1 to 

130 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 186 

- Ceres seq_id 1388502 

- Location of start within SEQ ID NO 183: at 266 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 121 

- gi No. 2829899 

- Description: 

- % Identity: 49.3 

- Alignment Length: 150 

- Location of Alignment in SEQ ID NO 186: from 1 to 

82 

(Ba) Polypeptide Activities: Similar to major latex protein 
activities . 



Maximum Length Sequence: 

related to: 
Clone IDs: 
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21304 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 187 

- Ceres seq_id 1388519 

5 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 188 

- Ceres seq_id 1388520 

10 - Location of start within SEQ ID NO 187: at 1 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 188 : at 38 aa . 

(C) Nomination and Annotation of Domains within 
15 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 122 

- gi No. 4584110 

- Description: 

20 - % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 188: from 20 

to 182 

25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 189 

- Ceres seq_id 1388521 

- Location of start within SEQ ID NO 187: at 58 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
30 ID NO 189: at 19 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 123 

- gi No. 4584110 

- Description: 

- % Identity: 48.5 

- Alignment Length: 163 

40 - Location of Alignment in SEQ ID NO 189: from 1 to 

163 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 190 
45 - Ceres seq_id 1388522 

- Location of start within SEQ ID NO 187: at 73 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 190: at 14 aa . 



50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 124 

- gi No. 4584110 

- Description: 

5 - % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 190: from 1 to 

158 

10 (Ba) Polypeptide Activities: Similar to pollen specific 
protein activities. 

Maximum Length Sequence: 
related to: 
15 Clone IDs: 
2153 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 191 

- Ceres seq_id 1388563 

20 - Alternative transcription start site(s) located in SEQ 

ID NO 191: 

-3, -2,-1, 13, 15, 146 

(B) Polypeptide Sequence 
25 - Pat. Appln. SEQ ID NO 192 

- Ceres seq_id 1388564 

- Location of start within SEQ ID NO 191: at 2 nt . 

(C) Nomination and Annotation of Domains within 
30 Predicted Polypept ide ( s ) 

(Dp) Related Amino Acid Sequences 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 193 
35 - Ceres seq_id 1388565 

- Location of start within SEQ ID NO 191: at 306 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
40 (Dp) Related Amino Acid Sequences 

- Alignment No. 125 

- gi No. 3927834 

- Description: 

- % Identity: 31.5 

45 - Alignment Length: 108 

- Location of Alignment in SEQ ID NO 193: from 1 to 

43 

(B) Polypeptide Sequence 
50 - Pat. Appln. SEQ ID NO 194 

- Ceres seq id 1388566 
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- Location of start within SEQ ID NO 191: at 455 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 194 : at 36 aa . 

(Ba) Polypeptide Activities: Similar to hydroxproline-rich 
glycoprotein activities . 



Maximum Length Sequence: 

related to: 
Clone IDs: 

22488 
Public Genomic DNA: 
gi No: 5708384 
Predicted Exons: 
INIT 82228 . . 
TERM 81730 
gi No: 5732090 
Predicted Exons : 
INIT 21301 . . 
TERM 20 8 03 
gi No: 5870169 
Predicted Exons: 
INIT 89258 . . 
TERM 89756 . . 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 195 

- Ceres seq_id 1388793 

- Alternative transcription start site (s 
ID NO 195: 



82063 
81381 



21136 
20454 



89423 
90105 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 



located in SEQ 



7, 14, 15, 20, 24, 30, 33,42, 44, 58, 63, 64, 66, 76, 94, 97, 98, 99, 101, 102 
103,104,105,106,108,109,110,115,116,117 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 196 

- Ceres seq_id 1388794 

- Location of start within SEQ ID NO 195: at 154 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 196: at 19 aa . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 12 6 

- gi No. 4584110 

- Description: 

- % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 196: from 1 to 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 197 

- Ceres seq_id 1388795 

5 - Location of start within SEQ ID NO 195: at 169 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 197 : at 14 aa . 

(C) Nomination and Annotation of Domains within 
10 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 127 

- gi No. 4584110 

- Description: 

15 - % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 197: from 1 to 

158 

20 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 198 

- Ceres seq_id 1388796 

- Location of start within SEQ ID NO 195: at 205 nt . 

25 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 128 

- gi No. 4584110 
30 - Description: 

- % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 198: from 1 to 

146 

35 

(Ba) Polypeptide Activities: Similar to pollen specific 
protein activities . 

Maximum Length Sequence: 
40 related to: 
Clone IDs: 

26569 
Public Genomic DNA: 
gi No: 6143856 
45 Predicted Exons : 

INTR 78628 ... 78255 OCKHAMG-CDNA 
INTR 7 8160 ... 78004 OCKHAMG-CDNA 



INTR 78624 ... 78255 OCKHAMG-CDNA 
50 INTR 78160 ... 78004 OCKHAMG-CDNA 
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INTR 
INTR 



78624 
78160 



173 
78255 
78006 



OCKHAMG-CDNA 
OCKHAMG-CDNA 



INTR 78624 
gi No: 6223633 
Predicted Exons 

INTR 78629 



INTR 

INTR 
INTR 

INTR 
INTR 

INTR 



78161 

78625 
78161 

78625 
78161 

78625 



7 8283 OCKHAMG-CDNA 



7 8 2 5 6 OC KHAMG - C DNA 

7 8 0 0 5 OCKHAMG-CDNA 

7 8 2 5 6 OCKHAMG-CDNA 

7 8 0 05 OCKHAMG-CDNA 

7 8 2 5 6 OC KHAMG - C DNA 

7 8 0 0 7 OCKHAMG-CDNA 

7 8284 OCKHAMG-CDNA 



OCKHAMG-CDS 
OCKHAMG-CDS 



INIT 78587 . . . 78256 
TERM 78161 ... 78152 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 199 

- Ceres seq_id 1392041 

- Alternative transcription start site(s) located in SEQ 
ID NO 199: 

3, 4, 5, 6, 7, 12 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 200 

- Ceres seq_id 1392042 

- Location of start within SEQ ID NO 199: at 45 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 200: at 19 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 200: from 23 to 112 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 12 9 

- gi No. 2497753 

- Description: 

- % Identity: 47 

- Alignment Length: 117 

- Location of Alignment in SEQ ID NO 200: 



from 4 to 



113 



Maximum Length Sequence: 

related to: 
Clone IDs: 
28475 
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Public Genomic DNA: 



gi No: 5731404 





Predicted 


Exons : 








INTR 


70440 . . . 


70346 


OCKHAMG-CDNA 


■J) 


TKTTR 


7 n n 7 f 


Q R f, S 






TKTTR 


SO R 

U J J u u . • • 


U J 4 J4 


r\ C K H A MP - P HKT A 




TKTTR 

_L IN _L I\ 


SR 

U J J JO . ■ ■ 


f Q 9 R 4 

U J4. O 4 


nrRHAMn-rnMA 

Uv> r\llriLL v 1^3 U IN xn. 




TKTTR 

JL IN ± JA 


\J_7_L_?nJ ... 


vj _? _L _L -J 


npRH A NIP — PDKT A 




TKTTR 

_L LN ± L\ 


O U >_> O ... 


UO J ^ 


n,CKU AMP — PHKT A 


1 n 


TKTTR 
_L IN ± r\ 


OOO'UO ... 


6 R f Q R 
o o o o 


nrRHAMP-rnMA 

v-* r\ il rll v l kj LJlNrl 




TKTTR 

_L IN J. 1\ 


6 R S R 1 


\J O O L ± *-± 


PPKH AMP — PHKT A 




INIT 


70056 . . . 


69865 


OC KHAMG -CDS 




INTR 


69508 . . . 


69434 


OCKHAMG-CDS 


15 


INTR 


69358 . . . 


69284 


OCKHAMG-CDS 




INTR 


69195 . . . 


69115 


OCKHAMG-CDS 




INTR 


69033 . . . 


68959 


OCKHAMG-CDS 




INTR 


68868 . . . 


68695 


OCKHAMG-CDS 




TERM 


68581 . . 


68546 


OCKHAMG-CDS 


20 


(Ac) cDNA Polynucleotide 


Sequence 





- Pat. Appln. SEQ ID NO 201 

- Ceres seq_id 1393342 

- Alternative transcription start site(s) located in SEQ 
ID NO 201: 

25 16,48 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 202 

- Ceres seq_id 1393343 

30 - Location of start within SEQ ID NO 201: at 118 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 130 

- gi No. 3386621 

- Description: 

- % Identity: 83 

- Alignment Length: 224 

40 - Location of Alignment in SEQ ID NO 202: from 2 to 

225 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 

4 5 

Maximum Length Sequence: 

related to: 
Clone IDs: 
29453 

50 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 203 
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- Ceres seq_id 1393556 

- Alternative transcription start site(s) located in SEQ 
ID NO 203: 

7, 11, 14,23, 37, 41, 42, 57 , 61 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 204 

- Ceres seq_id 1393557 

- Location of start within SEQ ID NO 203: at 3 nt. 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 131 
15 - gi No. 1082054 

- Description: 

- % Identity: 36.1 

- Alignment Length: 61 

- Location of Alignment in SEQ ID NO 204: from 23 

20 to 82 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 205 

- Ceres seq_id 1393558 

25 - Location of start within SEQ ID NO 203: at 69 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 205: at 41 aa. 

(C) Nomination and Annotation of Domains within 
30 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 132 

- gi No. 1082054 

- Description: 

35 - % Identity: 36.1 

- Alignment Length: 61 

- Location of Alignment in SEQ ID NO 205: from 1 to 

60 

40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 206 

- Ceres seq_id 1393559 

- Location of start within SEQ ID NO 203: at 72 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
45 ID NO 206: at 40 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
50 - Alignment No. 133 

- gi No. 1082054 
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- Description: 

- % Identity: 36.1 

- Alignment Length: 61 

- Location of Alignment in SEQ ID NO 206: from 1 to 



(Ba) Polypeptide Activities: Similar to transmembrane copper 
transporter protein activities. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
42384 

15 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 207 

- Ceres seq_id 1396782 

- Alternative transcription start site(s) located in SEQ 
ID NO 207 : 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 208 

- Ceres seq_id 1396783 

25 - Location of start within SEQ ID NO 207: at 3 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
30 - Alignment No. 134 

- gi No. 4512613 

- Description: 

- % Identity: 99.2 

- Alignment Length: 122 

35 - Location of Alignment in SEQ ID NO 208: from 11 

to 132 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 209 
40 - Ceres seq_id 1396784 

- Location of start within SEQ ID NO 207: at 33 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 209: at 19 aa. 



45 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 135 

- gi No. 4512613 
50 - Description: 

- % Identity: 99.2 
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- Alignment Length: 122 

- Location of Alignment in SEQ ID NO 209: from 1 to 

122 



5 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 210 

- Ceres seq_id 1396785 

- Location of start within SEQ ID NO 207: at 45 nt . 



10 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 136 

- gi No. 4512613 
15 - Description: 

- % Identity: 99.2 

- Alignment Length: 122 

- Location of Alignment in SEQ ID NO 210: from 1 to 

118 

20 

(Ba) Polypeptide Activities: Similar to intergenic region of 
cyt2-MDHl of yeast activities. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
42402 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 211 

- Ceres seq_id 1396802 

- Alternative transcription start site(s) located in SEQ 
ID NO 211 : 

-145,-56 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 212 

- Ceres seq_id 1396803 

- Location of start within SEQ ID NO 211: at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 212: at 19 aa . 



25 



30 



35 



40 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
45 - Alignment No. 137 

- gi No. 2463339 

- Description: 

- % Identity: 38.6 

- Alignment Length: 70 

50 - Location of Alignment in SEQ ID NO 212: from 136 

to 205 
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178 



;B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 213 

- Ceres seq_id 1396804 

- Location of start within SEQ 



ID NO 211: at 136 nt , 



(C) 

Predicted 
(Dp) 



10 



15 



to 160 



Nomination and Annotation of Domains within 
Polypeptide (s) 
Related Amino Acid Sequences 

- Alignment No. 138 

- gi No. 2463339 

- Description: 

- % Identity: 38.6 

- Alignment Length: 70 

- Location of Alignment in SEQ ID NO 213: from 91 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 214 
20 - Ceres seq_id 1396805 

- Location of start within SEQ ID NO 211: at 172 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
25 (Dp) Related Amino Acid Sequences 

- Alignment No. 139 

- gi No. 2463339 

- Description: 

- % Identity: 38.6 

30 - Alignment Length: 70 

- Location of Alignment in SEQ ID NO 214: from 79 

to 148 



(Ba) Polypeptide Activities: Similar to rbcX protein 
35 activities. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
40 5105 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 215 

- Ceres seq_id 1397130 

- Alternative transcription start site(s) located in SEQ 
45 ID NO 215: 

-1 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 216 
50 - Ceres seq_id 1397131 

- Location of start within SEQ ID NO 215: at 1 nt . 
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179 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
5 - Alignment No. 140 

- gi No. 2827551 

- Description: 

- % Identity: 63.1 

- Alignment Length: 17 9 

10 - Location of Alignment in SEQ ID NO 216: from 2 to 

176 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 217 
15 - Ceres seq_id 1397132 

- Location of start within SEQ ID NO 215: at 52 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
20 (Dp) Related Amino Acid Sequences 

- Alignment No. 141 

- gi No. 2827551 

- Description: 

- % Identity: 63.1 

25 - Alignment Length: 179 

- Location of Alignment in SEQ ID NO 217: from 1 to 

159 



(B) Polypeptide Sequence 
30 - Pat. Appln. SEQ ID NO 218 

- Ceres seq_id 1397133 

- Location of start within SEQ ID NO 215: at 211 nt . 



(C) Nomination and Annotation of Domains within 
35 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 142 

- gi No. 2827551 

- Description: 

40 - % Identity: 63.1 

- Alignment Length: 17 9 

- Location of Alignment in SEQ ID NO 218: from 1 to 

106 



45 (Ba) Polypeptide Activities: Similar to meth CpG binding 
protein activities . 



50 



Maximum Length Sequence: 

related to: 
Clone IDs: 
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92204 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 219 

- Ceres seq_id 1398004 

5 - Alternative transcription start site(s) located in SEQ 

ID NO 219: 

2/3,5/8/28 



(B) Polypeptide Sequence 
10 - Pat. Appln. SEQ ID NO 220 

- Ceres seq__id 1398005 

- Location of start within SEQ ID NO 219: at 68 nt 

(C) Nomination and Annotation of Domains within 
15 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 143 

- gi No. 4454037 

- Description: 

20 - % Identity: 99.3 

- Alignment Length: 151 

- Location of Alignment in SEQ ID NO 220: from 1 to 151 



(Ba) Polypeptide Activities: Similar to major latex protein 
25 activities. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
30 98584 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 221 

- Ceres seq_id 1399370 



35 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 222 

- Ceres seq_id 1399371 

- Location of start within SEQ ID NO 221: at 3 nt . 

40 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 144 
45 - gi No. 4886285 

- Description: 

- % Identity: 28.8 

- Alignment Length: 59 

- Location of Alignment in SEQ ID NO 222: from 25 

50 to 83 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 223 

- Ceres seq_id 1399372 

- Location of start within SEQ ID NO 221: at 45 nt . 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 145 
10 - gi No. 4886285 

- Description: 

- % Identity: 28.8 

- Alignment Length: 5 9 

- Location of Alignment in SEQ ID NO 223: from 11 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 224 

- Ceres seq_id 1399373 

20 - Location of start within SEQ ID NO 221: at 60 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 



(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 14 C 

- gi No. 4886285 

- Description: 

30 - % Identity: 28.8 

- Alignment Length: 59 

- Location of Alignment in SEQ ID NO 224: from 6 to 

64 

35 (Ba) Polypeptide Activities: Similar to outer envelope 
membrane protein in choloroplast in pea activities. 



Maximum Length Sequence: 
40 related to: 

Clone IDs: 

107400 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 225 
45 - Ceres seq_id 1425147 

- Alternative transcription start site(s) located in SEQ 
ID NO 225: 

20, 31, 49, 58, 80 

50 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 226 
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- Ceres seq_id 1425148 

- Location of start within SEQ ID NO 225: at 85 nt . 

(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 147 

- gi No. 3510256 

- Description : 

10 - % Identity: 37.8 

- Alignment Length: 158 

- Location of Alignment in SEQ ID NO 226: from 1 to 

155 

15 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 227 

- Ceres seq_id 1425149 

- Location of start within SEQ ID NO 225: at 241 nt . 

20 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 148 

- gi No. 3510256 
25 - Description: 

- % Identity: 37.8 

- Alignment I-ength: 158 

- Location of Alignment in SEQ ID NO 227: from 1 to 

103 



30 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 



Maximum Length Sequence: 
35 related to: 

Clone IDs: 
11073 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 22 8 
40 - Ceres seq_id 1441102 

- Alternative transcription start site(s) located in SEQ 
ID NO 228 : 

2, 3,4, 5, 8, 10, 16, 22, 51, 69, 388 

45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 229 

- Ceres seq_id 1441103 

- Location of start within SEQ ID NO 228: at 1 nt . 



50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 14 9 

- gi No. 4337175 

- Description: 

5 - % Identity: 100 

- Alignment Length: 179 

- Location of Alignment in SEQ ID NO 229: from 19 



10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 230 

- Ceres seq_id 1441104 

- Location of start within SEQ ID NO 228: at 55 nt . 



15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 150 

- gi No. 4337175 
20 - Description: 

- % Identity: 100 

- Alignment Length: 179 

- Location of Alignment in SEQ ID NO 230: from 1 to 

179 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 231 

- Ceres seq_id 1441105 

- Location of start within SEQ ID NO 228: at 94 nt . 

30 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 151 
35 - gi No. 4337175 

- Description: 

- % Identity: 100 

- Alignment Length: 17 9 

- Location of Alignment in SEQ ID NO 231: from 1 to 

40 166 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 



45 Maximum Length Sequence: 
related to: 
Clone IDs: 

267227 

(Ac) cDNA Polynucleotide Sequence 
50 - Pat. Appln. SEQ ID NO 232 

- Ceres seq_id 1447480 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 233 
5 - Ceres seq_id 1447481 

- Location of start within SEQ ID NO 232: at 52 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
10 (Dp) Related Amino Acid Sequences 

- Alignment No. 152 

- gi No. 3510256 

- Description: 

- % Identity: 54 

15 - Alignment Length: 50 

- Location of Alignment in SEQ ID NO 233: from 77 

to 126 



(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 234 

- Ceres seq_id 1447482 

- Location of start within SEQ ID NO 232: at 208 nt . 



(C) Nomination and Annotation of Domains within 
25 Predicted Polypept ide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 153 

- gi No. 3510256 

- Description: 

30 - % Identity: 54 

- Alignment Length: 50 

- Location of Alignment in SEQ ID NO 234: from 25 



35 (Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 



Maximum Length Sequence: 
related to: 
40 Clone IDs: 

268712 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 235 

- Ceres seq_id 1447577 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 236 

- Ceres seq_id 1447578 

50 - Location of start within SEQ ID NO 235: at 2 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 154 
5 - gi No. 5080769 

- Description: 

- % Identity: 100 

- Alignment Length: 17 

- Location of Alignment in SEQ ID NO 236: from 16 

10 to 32 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 237 

- Ceres seq_id 1447579 

15 - Location of start within SEQ ID NO 235: at 165 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
20 - Alignment No. 155 

- gi No. 5080769 

- Description: 

- % Identity: 91.5 

- Alignment Length: 153 

25 - Location of Alignment in SEQ ID NO 237: from 1 to 

115 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 238 
30 - Ceres seq_id 1447580 

- Location of start within SEQ ID NO 235: at 298 nt 

(Ba) Polypeptide Activities: Similar to 
hydroxyproline-rich glycoprotein activities . 

35 



Maximum Length Sequence: 

related to: 
Clone IDs: 
40 94821 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 239 

- Ceres seq_id 1447922 

- Alternative transcription start site(s) located in SEQ 
45 ID NO 239: 

12, 16 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 240 
50 - Ceres seq_id 1447923 

- Location of start within SEQ ID NO 239: at 81 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
5 - Alignment No. 156 

- gi No. 3510256 

- Description: 

- % Identity: 37.8 

- Alignment Length: 158 

10 - Location of Alignment in SEQ ID NO 240: from 1 to 

155 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 241 
15 - Ceres seq_id 1447924 

- Location of start within SEQ ID NO 239: at 237 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
20 (Dp) Related Amino Acid Sequences 

- Alignment No. 157 

- gi No. 3510256 

- Description: 

- % Identity: 37.8 

25 - Alignment Length: 158 

- Location of Alignment in SEQ ID NO 241: from 1 to 

103 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
30 dicot specific gene, plant specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
35 20539 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 242 

- Ceres seq_id 1448012 

- Alternative transcription start site(s) located in SEQ 
40 ID NO 242 : 

-30, -6 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 243 
45 - Ceres seq_id 1448013 

- Location of start within SEQ ID NO 242: at 79 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
50 - Ribosomal protein S16 

- Location within SEQ ID NO 243: from 9 to 70 aa. 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 158 

- gi No. 3096931 
5 - Description: 

- % Identity: 57.5 

- Alignment Length: 113 

- Location of Alignment in SEQ ID NO 243: from 1 to 

112 

10 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 244 

- Ceres seq_id 1448014 

- Location of start within SEQ ID NO 242: at 139 nt . 

15 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S16 

- Location within SEQ ID NO 244: from 1 to 50 aa. 

20 

(Dp) Related Amino Acid Sequences 

- Alignment No. 159 

- gi No. 3096931 

- Description: 

25 - % Identity: 57.5 

- Alignment Length: 113 

- Location of Alignment in SEQ ID NO 244: from 1 to 

92 

30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 245 

- Ceres seq_id 1448015 

- Location of start within SEQ ID NO 242: at 232 nt . 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 160 

- gi No. 3096931 
40 - Description: 

- % Identity: 57.5 

- Alignment Length: 113 

- Location of Alignment in SEQ ID NO 245: from 1 to 

61 

45 

Maximum Length Sequence: 

related to: 
Clone IDs: 
34091 

50 (Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 246 
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- Ceres seq_id 1448135 

- Alternative transcription start site(s) located in SEQ 
ID NO 246: 

-1 

5 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 247 

- Ceres seq_id 1448136 

- Location of start within SEQ ID NO 246: at 3 nt . 

10 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 



15 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 248 

- Ceres seq_id 1448137 

- Location of start within SEQ ID NO 246: at 68 nt . 



20 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Ribosomal protein S14p/S29e 

- Location within SEQ ID NO 248: from 3 to 54 aa. 



25 (Dp) Related Amino Acid Sequences 

- Alignment No. 161 

- gi No. 4506717 

- Description: 

- % Identity: 72.2 

30 - Alignment Length: 54 

- Location of Alignment in SEQ ID NO 248: from 1 to 

54 



(B) Polypeptide Sequence 
35 - Pat. Appln. SEQ ID NO 249 

- Ceres seq_id 1448138 

- Location of start within SEQ ID NO 246: at 85 nt . 

(C) Nomination and Annotation of Domains within 
40 Predicted Polypept ide ( s ) 

(Dp) Related Amino Acid Sequences 



Maximum Length Sequence: 
related to: 
45 Clone IDs: 
39285 
Public Genomic DNA: 
gi No: 4406776 
Predicted Exons: 
50 SINGLE 61584 ... 61895 GENBANK 
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SINGLE 61584 ... 61895 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 250 

- Ceres seq_id 1448185 

5 - Alternative transcription start site(s) located in SEQ 

ID NO 250: 

-36, -18, -17, 16, 19 

(B) Polypeptide Sequence 
10 - Pat. Appln. SEQ ID NO 251 

- Ceres seq_id 1448186 

- Location of start within SEQ ID NO 250: at 41 nt . 



(C) Nomination and Annotation of Domains within 
15 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 162 

- gi No. 4406787 

- Description: 

20 - % Identity: 100 

- Alignment Length: 103 

- Location of Alignment in SEQ ID NO 251: from 1 to 

103 



25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 252 

- Ceres seq_id 1448187 

- Location of start within SEQ ID NO 250: at 68 nt . 



30 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 163 

- gi No. 4406787 
35 - Description: 

- % Identity: 100 

- Alignment Length: 103 

- Location of Alignment in SEQ ID NO 252: from 1 to 

94 

40 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 253 

- Ceres seq_id 1448188 

- Location of start within SEQ ID NO 250: at 89 nt . 

45 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 164 
50 - gi No. 4406787 

- Description: 
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- % Identity: 100 

- Alignment Length: 103 

- Location of Alignment in SEQ ID NO 253: from 1 to 87 

5 (Ba) Polypeptide Activities: Similar to NADH dehydrogenase 
protein activities 



10 Maximum Length Sequence: 
related to: 
Clone IDs: 
99119 

(Ac) cDNA Polynucleotide Sequence 
15 - Pat. Appln. SEQ ID NO 254 

- Ceres seq_id 1450875 



(B) Polypeptide Sequence 
20 - Pat. Appln, SEQ ID NO 255 

- Ceres seq_id 1450876 

- Location of start within SEQ ID NO 254: at 2 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 255: at 26 aa. 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- ATPases associated with various cellular 
activities (AAA) 

30 - Location within SEQ ID NO 255: from 391 to 598 

aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 165 
35 - gi No. 4309734 

- Description: 

- % Identity: 86 

- Alignment Length: 633 

- Location of Alignment in SEQ ID NO 255: from 9 to 

40 634 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 256 

- Ceres seq_id 1450877 

45 - Location of start within SEQ ID NO 254: at 14 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 256: at 22 aa . 



(C) Nomination and Annotation of Domains within 
50 Predicted Polypeptide ( s ) 
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- ATPases associated with various cellular 
activities (AAA) 

- Location within SEQ ID NO 256: from 387 to 594 

aa . 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 166 

- gi No. 4309734 

- Description: 

10 - % Identity: 86 

- Alignment Length: 633 

- Location of Alignment in SEQ ID NO 256: from 5 to 

63 0 



15 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 257 

- Ceres seq_id 1450878 

- Location of start within SEQ ID NO 254: at 68 nt . 



20 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- ATPases associated with various cellular 
activities (AAA) 

- Location within SEQ ID NO 257: from 369 to 576 

25 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 167 

- gi No. 4309734 
30 - Description: 

- % Identity: 86 

- Alignment Length: 633 

- Location of Alignment in SEQ ID NO 257: from 1 to 

612 

35 

Maximum Length Sequence: 

related to: 
Clone IDs: 

269321 

40 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 258 

- Ceres seq_id 1459191 



45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 259 

- Ceres seq_id 1459192 

- Location of start within SEQ ID NO 258: at 1 nt . 



50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 168 

- gi No. 4490728 

- Description: 

- % Identity: 100 

- Alignment Length: 17 

- Location of Alignment in SEQ ID NO 259: from 21 

to 37 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 260 

- Ceres seq_id 1459193 

- Location of start within SEQ ID NO 258: at 3 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 169 

- gi No. 4490728 

- Description: 

- % Identity: 85.9 

- Alignment Length: 85 

- Location of Alignment in SEQ ID NO 260: from 37 

to 110 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
8446 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 261 

- Ceres seq__id 1461848 

- Alternative transcription start site(s) located in SEQ 
ID NO 261: 

-3,6,7,11,31,32,34,62,63,69,70,71,97 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 2 62 

- Ceres seq__id 1461849 

- Location of start within SEQ ID NO 261: at 113 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 262: at 36 aa . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- MAPEG family 

- Location within SEQ ID NO 262: from 59 to 136 aa . 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 17 0 

- gi No. 4758714 

- Description: 

- % Identity: 41.9 

- Alignment Length: 131 

- Location of Alignment in SEQ ID NO 262: from 8 to 



136 



10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 263 

- Ceres seq_id 1461850 

- Location of start within SEQ ID NO 261: at 197 nt . 

15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide ( s ) 

- MAPEG family 

- Location within SEQ ID NO 263: from 31 to 108 aa . 

20 (Dp) Related Amino Acid Sequences 

- Alignment No. 171 

- gi No. 4758714 

- Description: 

- % Identity: 41.9 

25 - Alignment Length: 131 

- Location of Alignment in SEQ ID NO 263: from 1 to 

108 

(B) Polypeptide Sequence 
30 - Pat. Appln. SEQ ID NO 264 

- Ceres seq_id 1461851 

- Location of start within SEQ ID NO 261: at 329 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 264 : at 22 aa . 

35 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- MAPEG family 

- Location within SEQ ID NO 264: from 1 to 64 aa . 

40 

(Dp) Related Amino Acid Sequences 

- Alignment No. 172 

- gi No. 4758714 

- Description: 

45 - % Identity: 41.9 

- Alignment Length: 131 

- Location of Alignment in SEQ ID NO 264: from 1 to 

64 



50 Maximum Length Sequence: 
related to: 



WO 00/40695 



PCTYUS00/00466 



194 

Clone IDs: 
25093 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 265 

- Ceres seq_id 1472772 

- Alternative transcription start site(s) located in SEQ 
ID NO 265: 

-1,3 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 266 

- Ceres seq_id 1472773 

- Location of start within SEQ ID NO 265: at 3 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 266: at 32 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 266: from 45 to 108 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 17 3 

- gi No. 3062791 

- Description: 

- % Identity: 72.2 

- Alignment Length: 90 

- Location of Alignment in SEQ ID NO 266: from 21 

to 110 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 267 

- Ceres seq_id 1472774 

- Location of start within SEQ ID NO 265: at 27 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 267 : at 24 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 267: from 37 to 100 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 174 

- gi No. 3062791 

- Description: 

- % Identity: 7 2.2 

- Alignment Length: 90 

- Location of Alignment in SEQ ID NO 267: from 13 

to 102 
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Maximum Length Sequence: 

related to: 
Clone IDs: 
42300 

5 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 268 

- Ceres seq_id 1533352 

- Alternative transcription start site(s) located in SEQ 
ID NO 268 : 

10 17 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 269 

- Ceres seq_id 1533353 

15 - Location of start within SEQ ID NO 268: at 1 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- haloacid dehalogenase-like hydrolase 
20 - Location within SEQ ID NO 269: from 82 to 266 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 175 

- gi No. 3913203 
25 - Description: 

- % Identity: 32.8 

- Alignment Length: 196 

- Location of Alignment in SEQ ID NO 269: from 81 

to 266 

30 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 270 

- Ceres seq_id 1533354 

- Location of start within SEQ ID NO 268: at 25 nt . 

35 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- haloacid dehalogenase-like hydrolase 

- Location within SEQ ID NO 270: from 74 to 258 aa . 

40 

(Dp) Related Amino Acid Sequences 

- Alignment No. 17 6 

- gi No. 3913203 

- Description: 

45 - % Identity: 32.8 

- Alignment Length: 196 

- Location of Alignment in SEQ ID NO 270: from 73 

to 258 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 271 
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- Ceres seq_id 1533355 

- Location of start within SEQ ID NO 268: at 265 nt . 

(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

- haloacid dehalogenase-like hydrolase 

- Location within SEQ ID NO 271: from 1 to 178 aa . 

(Dp) Related Amino Acid Sequences 
10 - Alignment No. 177 

- gi No. 3913203 

- Description: 

- % Identity: 32.8 

- Alignment Length: 196 

15 - Location of Alignment in SEQ ID NO 271: from 1 to 

178 

Maximum Length Sequence: 
related to: 
20 Clone IDs: 

158412 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 272 

- Ceres seq_id 1534544 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 273 

- Ceres seq_id 1534545 

30 - Location of start within SEQ ID NO 272: at 2 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 178 

- gi No. 2317676 

- Description: 

- % Identity: 37.2 

- Alignment Length: 115 

40 - Location of Alignment in SEQ ID NO 273: from 22 

to 134 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 274 
45 - Ceres seq_id 1534546 

- Location of start within SEQ ID NO 272: at 524 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
50 (Dp) Related Amino Acid Sequences 

- Alignment No. 17 9 



WO 00/40695 



PCT/USOO/00466 



197 

gi No. 4836939 
Description : 
% Identity: 47.3 
Alignment Length: 150 

Location of Alignment in SEQ ID NO 274: from 19 



to 166 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 275 
10 - Ceres seq_id 1534547 

- Location of start within SEQ ID NO 272: at 815 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 (Dp) Related Amino Acid Sequences 

- Alignment No. 18 0 

- gi No. 4836939 

- Description: 

- % Identity: 47.3 

20 - Alignment Length: 150 

- Location of Alignment in SEQ ID NO 275: from 1 to 

69 

(Ba) Polypeptide Activities: Similar to calcium independent 
25 phrophospholipase A2 protein activities, and acy-protein 
thioesterase activities. 



Maximum Length Sequence: 
30 related to: 
Clone IDs: 

108109 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ. ID NO 276 
35 - Ceres seq_id 1567172 

- Alternative transcription start site(s) located in SEQ 
ID NO 27 6: 

22 

40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 277 

- Ceres seq_id 1567173 

- Location of start within SEQ ID NO 276: at 152 nt . 

45 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 181 

- gi No. 1076301 
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- Description: CONSTANS protein - Arabidopsis 
thaliana >gi I 1161514 | emb | CAA64407 | (X94937) CONSTANS protein 
[Arabidopsis thaliana] 

- % Identity: 49.2 

5 - Alignment Length: 63 

- Location of Alignment in SEQ ID NO 277: from 12 

to 74 

(B) Polypeptide Sequence 
10 - Pat. Appln. SEQ ID NO 278 

- Ceres seq_id 1567174 

- Location of start within SEQ ID NO 276: at 155 nt . 

(C) Nomination and Annotation of Domains within 
15 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 182 

- gi No. 1076301 

- Description: CONSTANS protein - Arabidopsis 

20 thaliana >gi | 1161514 | emb | CAA64407 | (X94937) CONSTANS protein 
[Arabidopsis thaliana] 

- % Identity: 49.2 

- Alignment Length: 63 

- Location of Alignment in SEQ ID NO 278: from 11 

25 to 73 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 279 

- Ceres seq_id 1567175 

30 - Location of start within SEQ ID NO 276: at 443 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 183 

- gi No. 1076301 

- Description: CONSTANS protein - Arabidopsis 
thaliana >gi | 1161514 | emb | CAA64407 | (X94937) CONSTANS protein 
[Arabidopsis thaliana ] 

40 - % Identity: 67.4 

- Alignment Length: 4 3 

- Location of Alignment in SEQ ID NO 279: from 261 

to 303 

45 Maximum Length Sequence: 
related to: 
Clone IDs: 
168 

Public Genomic DNA: 
50 gi No: 4895213 

Predicted Exons : 
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INIT 22484 . . . 22368 OCKHAMG-CDS 
TERM 21760 ... 21704 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 280 
5 - Ceres seq__id 1567535 

- Alternative transcription start site(s) located in SEQ 
ID NO 280: 

12,22,23,25,28,36, 68 

10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 281 

- Ceres seq_id 1567536 

- Location of start within SEQ ID NO 280: at 104 nt . 

15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 184 

- gi No. 1351365 

20 - Description: UBIQUINOL-CYTOCHROME C REDUCTASE 

COMPLEX 6.7 KD PROTEIN (CR6) >gi | 2 1 3 0 0 02 | pir | | S 68 9 6 9 
ubiquinol--cytochrome-c reductase (EC 1.10.2.2) - potato 
>gi I 633683 I emb | CAA57768 I (X82325) cytochrome c reductase 
subunit [Solanum tuberosum] 

25 - % Identity: 52.6 

- Alignment Length: 58 

- Location of Alignment in SEQ ID NO 281: from 2 to 

57 

30 Maximum Length Sequence: 
related to: 
Clone IDs: 
21305 

(Ac) cDNA Polynucleotide Sequence 
35 - Pat. Appln. SEQ ID NO 282 

- Ceres seq_id 1569689 

- Alternative transcription start site(s) located in SEQ 
ID NO 282 : 

28,29,49 

40 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 283 

- Ceres seq__id 1569690 

- Location of start within SEQ ID NO 282: at 1 nt . 

45 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 283: at 32 aa. 



50 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 284 

- Ceres seq_id 1569691 

- Location of start within SEQ ID NO 282: at 92 nt. 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 185 
10 - gi No. 2191138 

- Description: (AF007269) A_IG002N0 1 . 18 gene 
product [Arabidopsis thaliana] 

- % Identity: 19.5 

- Alignment Length: 8 7 

15 - Location of Alignment in SEQ ID NO 284: from 3 to 

83 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 285 
20 - Ceres seq_id 1569692 

- Location of start within SEQ ID NO 282: at 248 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
25 (Dp) Related Amino Acid Sequences 

- Alignment No. 18 6 

- gi No. 2191138 

- Description: (AF007269) A_IG002N01 . 18 gene 
product [Arabidopsis thaliana] 

30 - % Identity: 19.5 

- Alignment Length: 87 

- Location of Alignment in SEQ ID NO 285: from 1 to 

31 

35 (Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 

Maximum Length Sequence: 
related to: 
40 Clone IDs: 
41988 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 286 

- Ceres seq_id 1571042 

45 - Alternative transcription start site(s) located in SEQ 

ID NO 286: 

-1,2,3,4,7 

(B) Polypeptide Sequence 
50 - Pat. Appln. SEQ ID NO 287 

- Ceres seq id 1571043 
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- Location of start within SEQ ID NO 286: at 63 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
5 (Dp) Related Amino Acid Sequences 

- Alignment No. 187 

- gi No. 2129641 

- Description: major latex protein type 1 - 
Arabidopsis thaliana >gi | 1107493 | emb | CAA63026 | (X91960) major 

10 latex protein typel [Arabidopsis thaliana] 

- % Identity: 71.3 

- Alignment Length: 150 

- Location of Alignment in SEQ ID NO 287: from 1 to 

150 

15 

Maximum Length Sequence: 

related to: 
Clone IDs: 
16143 

20 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 288 

- Ceres seq__id 1571079 

- Alternative transcription start site(s) located in SEQ 
ID NO 288 : 

25 -5, 37, 86, 341, 350 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 289 

- Ceres seq_id 1571080 

30 - Location of start within SEQ ID NO 288: at 51 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Pathogenesis-related protein Bet v I family 

35 - Location within SEQ ID NO 289: from 5 to 155 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 188 

- gi No. 1321731 

40 - Description: (Z72439) major allergen Cor a 1 

[Corylus avellana] 

- % Identity: 35.7 

- Alignment Length: 159 

- Location of Alignment in SEQ ID NO 289: from 5 to 

45 155 

Maximum Length Sequence: 

related to: 
Clone IDs: 
50 42101 

(Ac) cDNA Polynucleotide Sequence 
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- Pat. Appln. SEQ ID NO 290 

- Ceres seq_id 1572097 



5 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 291 

- Ceres seq_id 1572098 

- Location of start within SEQ ID NO 290: at 63 nt . 

10 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Adhesion lipoprotein 

- Location within SEQ ID NO 291: from 118 to 191 

aa . 

15 

(Dp) Related Amino Acid Sequences 

- Alignment Mo. 18 9 

- gi No. 2129641 

- Description: major latex protein type 1 - 

20 Arabidopsis thaliana >gi I 1 1 07 4 93 | emb | CAA6302 6 | (X91960) major 
latex protein typel [Arabidopsis thaliana] 

- % Identity: 69.5 

- Alignment Length: 154 

- Location of Alignment in SEQ ID NO 291: from 1 to 

25 154 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 292 

- Ceres seq_id 1572099 

30 - Location of start within SEQ ID NO 290: at 336 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Adhesion lipoprotein 

35 - Locarion within SEQ ID NO 292: from 27 to 100 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 190 

- gi No. 2129641 

40 - Description: major latex protein type 1 - 

Arabidopsis thaliana >gi I 1107493 | emb I CAA63026 | (X91960) major 
latex protein typel [Arabidopsis thaliana] 

- % Identity: 69.5 

- Alignment Length: 154 

45 - Location of Alignment in SEQ ID NO 292: from 1 to 

63 

Maximum Length Sequence: 
related to: 
50 Clone IDs: 
27643 
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(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 293 

- Ceres seq_id 1572890 

- Alternative transcription start site(s) located in SEQ 
5 ID NO 293: 

-10 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 294 
10 - Ceres seq_id 1572891 

- Location of start within SEQ ID NO 293: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 - Photosystem I psaG / psaK 

- Location within SEQ ID NO 294: from 71 to 151 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 191 
20 - gi No. 3885511 

- Description: (AF084200) similar to PSI-K subunit 
of photosystem I from barley [Medicago sativa] 

- % Identity: 80.3 

- Alignment Length: 128 

25 - Location of Alignment in SEQ ID NO 294: from 22 

to 148 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 295 
30 - Ceres seq_id 1572892 

- Location of start within SEQ ID NO 293: at 64 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
35 - Photosystem I psaG / psaK 

- Location within SEQ ID NO 295: from 50 to 130 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 192 
40 - gi No. 3885511 

- Description: (AF084200) similar to PSI-K subunit 
of photosystem I from barley [Medicago sativa] 

- % Identity: 80.3 

- Alignment Length: 128 

45 - Location of Alignment in SEQ ID NO 295: from 1 to 

127 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 296 
50 - Ceres seq_id 1572893 

- Location of start within SEQ ID NO 293: at 76 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Photosystem I psaG / psaK 

- Location within SEQ ID NO 296: from 46 to 126 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 193 

- gi No. 3885511 

10 - Description: (AF084200) similar to PSI-K subunit 

of photosysrem I from barley [Medicago sativa] 

- % Identity: 80.3 

- Alignment Length: 128 

- Location of Alignmenr in SEQ ID NO 296: from 1 to 



Maximum Length Sequence: 

related to: 
Clone IDs: 
20 33027 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 297 

- Ceres seq_id 1573606 

- Alternative transcription start site(s) located in SEQ 
25 ID NO 297: 

7, 8, 13 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 298 
30 - Ceres seq_id 1573607 

- Location of start within SEQ ID NO 297: at 56 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 298: at 19 aa. 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Pollen proteins Ole e I family 

- Location within SEQ ID NO 298: from 32 to 131 aa. 



40 (Dp) Related Amino Acid Sequences 

- Alignment No. 194 

- gi No. 4584110 

- Description: (AJ133639) SAH7 protein [Arabidopsis 

thaliana] 

45 - % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 298: from 1 to 

163 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 299 
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- Ceres seq_id 1573608 

- Location of start within SEQ ID NO 297: at 71 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 299: at 14 aa. 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Pollen proteins Ole e I family 

- Location within SEQ ID NO 299: from 27 to 126 aa. 

10 

(Dp) Related Amino Acid Sequences 

- Alignment No. 195 

- gi No. 4584110 

- Description: (AJ133639) SAH7 protein [Arabidopsis 

15 thaliana] 

- % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 299: from 1 to 

158 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 300 

- Ceres seq__id 1573609 

- Location of start within SEQ ID NO 297: at 107 nt. 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Pollen proteins Ole e I family 

- Location within SEQ ID NO 300: from 15 to 114 aa. 

30 

(Dp) Related Amino Acid Sequences 

- Alignment No. 196 

- gi No. 4584110 

- Description: (AJ133639) SAH7 protein [Arabidopsis 

35 thaliana] 

- % Identity: 48.5 

- Alignment Length: 163 

- Location of Alignment in SEQ ID NO 300: from 1 to 

146 

40 

Maximum Length Sequence: 

related to: 
Clone IDs: 
31422 

45 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 301 

- Ceres seq__id 1573861 



50 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 302 
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- Ceres seq_id 1573862 

- Location of start within SEQ ID NO 301: at 67 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 197 

- gi No. 2191138 

- Description: (AF007269) A_IG002N0 1 . 1 8 gene 
product [Arabidopsis thaliana] 

- % Identity: 28.1 

- Alignment Length: 160 

- Location of Alignment in SEQ ID NO 302: from 3 to 

156 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 303 

- Ceres seq_id 1573863 

- Location of start within SEQ ID NO 301: at 223 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 198 

- gi No. 2191138 

- Description: (AF007269) A_IG0 02N0 1 . 1 8 gene 
product [Arabidopsis thaliana] 

- % Identity: 28.1 

- Alignment Length: 160 

- Location of Alignment in SEQ ID NO 303: from 1 to 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
40916 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 304 

- Ceres seq_id 1574093 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 305 

- Ceres seq__id 1574094 

- Location of start within SEQ ID NO 304: at 47 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
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- Zinc finger, C3HC4 type (RING finger) 

- Location within SEQ ID NO 305: from 152 to 192 

aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 199 

- gi No. 3790593 

- Description: (AF079185) RING-H2 finger protein 
RHYla [Arabidopsis thaliana] 

- % Identity: 55.1 

- Alignment Length: 4 9 

- Location of Alignment in SEQ ID NO 305: from 148 

to 196 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 306 

- Ceres seq_id 1574095 

- Location of start within SEQ ID NO 304: at 56 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Zinc finger, C3HC4 type (RING finger) 

- Location within SEQ ID NO 306: from 149 to 189 

aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 200 

- gi No. 3790593 

- Description: (AF079185) RING-H2 finger protein 
RHYla [Arabidopsis thaliana] 

- % Identity: 55.1 

- Alignment Length: 4 9 

- Location of Alignment in SEQ ID NO 306: from 145 

to 193 

Maximum Length Sequence: 

related to: 
Clone IDs: 
35447 

Public Genomic DNA: 
gi No: 4263694 
Predicted Exons : 



INIT 


79640 


79806 


GENBANK 




TERM 


79888 . . 


80164 


GENBANK 




INTR 


79508 .. 


79806 


OCKHAMG- 


■CDNA 


INTR 


79888 . . 


. 80309 


OCKHAMG- 


CDNA 



INIT 79640 . . . 79806 
TERM 79888 . . . 80164 
(Ac) cDNA Polynucleotide Sequence 



OCKHAMG-CDS 
OCKHAMG-CDS 
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- Pat. Appln. SEQ ID NO 307 

- Ceres seq__id 1580388 

- Alternative transcription start site(s) located in SEQ 
ID NO 307 : 

2, 3, 4, 5, 6, 7, 63, 94 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 308 

- Ceres seq_id 1580389 

- Location of start within SEQ ID NO 307: at 133 nr. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 3 08 : at 2 7 aa . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Cystatin domain 

- Location within SEQ ID NO 308: from 87 to 141 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 201 

- gi No. 2204077 

- Description: (D85623) extracellular insoluble 
cystatin [Daucus carota] 

- % Identity: 44.9 

- Alignment Length: 136 

- Location of Alignment in SEQ ID NO 308: from 6 to 

141 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 309 

- Ceres seq_id 1580390 

- Location of start: within SEQ ID NO 307: at 142 m. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 309: at 24 aa . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Cystatin domain 

- Location within SEQ ID NO 309: from 84 to 138 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 202 

- gi No. 2204077 

- Description: (D85623) extracellular insoluble 
cystatin [Daucus carota] 

- % Identity: 44.9 

- Alignment Length: 136 

- Location of Alignment in SEQ ID NO 309: from 3 to 

138 



Maximum Length Sequence: 
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related to: 
Clone IDs: 

22677 
Public Genomic DNA: 
5 gi No: 4972043 

Predicted Exons : 

SINGLE 18935 ... 19282 GENBANK 



INTR 18856 ... 19527 OCKHAMG-CDNA 

10 

SINGLE 18935 ... 19282 OCKHAMG-CDS 
gi No: 4972077 
Predicted Exons: 

INTR 60360 ... 61031 OCKHAMG-CDNA 

15 

SINGLE 60439 ... 60786 OCKHAMG-CDS 
gi No: 5748495 
Predicted Exons: 

INTR 60360 . . . 61031 OCKHAMG-CDNA 

20 

SINGLE 60439 ... 60786 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 310 

- Ceres seq_id 1582959 

25 - Alternative transcription start site(s) located in SEQ 

ID NO 310: 

2, 5, 6, 7, 58, 61 



(B) Polypeptide Sequence 
30 - Pat. Appln. SEQ ID NO 311 

- Ceres seq__id 1582960 

- Location of start within SEQ ID NO 310: at 80 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 311: at 19 aa. 

35 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Dehydrins 

- Location within SEQ ID NO 311: from 37 to 113 aa . 

40 

(Dp) Related Amino Acid Sequences 

- Alignment No. 203 

- gi No. 4972049 

- Description: (AL078470) glycine-rich protein like 
45 [Arabidopsis thaliana] 

- % Identity: 100 

- Alignment Length: 115 

- Location of Alignment in SEQ ID NO 311: from 1 to 

115 

50 

Maximum Length Sequence: 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 204 

- gi No. 133940 
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- Description: 40S RIBOSOMAL PROTEIN S3A (S1A) 

>gi I 7 0851 | pir M R3XL3A ribosomal protein S3a - African clawed 
frog >gi | 65091 | emb | CAA40592 | (X57322) ribosomal protein Sla 
[Xenopus laevis] 
5 - % Identity: 80.1 

- Alignment Length: 231 

- Location of Alignment in SEQ ID NO 313: from 1 to 

228 

10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 314 

- Ceres seq_id 1663223 

- Location of start within SEQ ID NO 312: at 204 nt . 

15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide ( s ) 

- Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 314: from 61 to 145 aa . 

20 (Dp) Related Amino Acid Sequences 

- Alignment No. 205 

- gi No. 133940 

- Description: 40S RIBOSOMAL PROTEIN S3A (S1A) 

>gi I 7 08 51 | pir | | R3XL3A ribosomal protein S3a - African clawed 
25 frog >gi | 65 0 9 1 | emb I CAA4 0 5 92 | (X57322) ribosomal protein Sla 
[Xenopus laevis] 

- % Identity: 80.1 

- Alignment Length: 231 

- Location of Alignment in SEQ ID NO 314: from 1 to 

30 185 

Maximum Length Sequence: 

related to: 
Clone IDs: 
35 3996 
2749 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 315 

- Ceres seq_id 1663275 

40 - Alternative transcription start site(s) located in SEQ 

ID NO 315: 

3,4,5,6,7,8,9,15 

- Clone 3996 starts at 1 and ends at 484 in cDNA. 

- Clone 2749 starts at 7 and ends at 488 in cDNA. 

45 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 316 

- Ceres seq_id 1663276 

- Location of start within SEQ ID NO 315: at 75 nt . 

50 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein Sl4p/S29e 

- Location within SEQ ID NO 316: from 3 to 54 aa. 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 206 

- gi No. 4506717 

- Description: ref | NP_001023 . 1 | pRPS29 | ribosomal 
10 protein S29 >gi | 266972 I sp | P30054 | RS29_HUMAN 40S RIBOSOMAL 

PROTEIN S29 >gi | 6318 8 4 I pir | | S302 98 ribosomal protein S29 - 
rat >gi | 1362934 | pir | I S55919 ribosomal protein S29 - human 
protein S29 [Homo sapiens] 

- % Identity: 72.2 

15 - Alignment Length: 54 

- Location of Alignment in SEQ ID NO 316: from 1 to 

54 

(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 317 

- Ceres seq_id 1663277 

- Location of start within SEQ ID NO 315: at 92 nt . 

(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

Maximum Length Sequence: 
related to: 
30 Clone IDs: 

114940 
25068 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NC 318 
35 - Ceres seq_id 1665304 

- Alternative transcription start site(s) located in SEQ 
ID NO 318: 

-8,-3, -1,2, 3,4,8, 17, 251 

- Clone 114940 starts at 1 and ends at 743 in cDNA. 
40 - Clone 25068 starts at 4 and ends at 760 in cDNA. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 319 

- Ceres seq__id 1665305 

45 - Location of start within SEQ ID NO 318: at 63 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
50 - Alignment No. 207 

- gi No. 2129641 
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- Description: major latex protein type 1 - 
Arabidopsis thaliana >gi I 1 1 07 4 93 | emb I CAA6302 6 | (X91960) major 
latex protein typel [Arabidopsis thaliana] 

- % Identity: 71.3 

5 - Alignment Length: 150 

- Location of Alignment in SEQ ID NO 319: from 1 to 

150 

Maximum Length Sequence: 
10 related to: 
Clone IDs: 
94673 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 320 
15 - Ceres seq_id 1709970 

- Alternative transcription start site(s) located in SEQ 
ID NO 320: 

-2,-1,11,41,42,43,44,45,46,47,48,51 

20 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 321 

- Ceres seq_id 1709971 

- Location of start within SEQ ID NO 320: at 276 nt . 

25 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 208 

- gi No. 2961300 

30 - Description: (AJ225027) ribosomal protein L24 

[Cicer arietinum] 

- % Identity: 86.2 

- Alignment Length: 160 

- Location of Alignment in SEQ ID NO 321: from 1 to 

35 107 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 322 

- Ceres seq_id 1709972 

40 - Location of start within SEQ ID NO 320: at 424 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

4 5 

Maximum Length Sequence: 

related to: 
Clone IDs: 

112835 

50 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 323 
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- Ceres seq_id 1711273 

- Alternative transcription start site(s) located in SEQ 
ID NO 323: 

2, 28, 30, 33, 56, 62, 235 

5 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 324 

- Ceres seq_id 1711274 

- Location of start within SEQ ID NO 323: at 92 nt . 

10 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Helix- turn-helix 

- Location within SEQ ID NO 324: from 87 to 141 aa . 

15 

(Dp) Related Amino Acid Sequences 

- Alignment No. 209 

- gi No. 1632831 

- Description: (Z49698) orf [Ricinus communis] 
20 - % Identity: 81 

- Alignment Length: 142 

- Location of Alignment in SEQ ID NO 324: from 1 to 

142 

25 Maximum Length Sequence: 
related to: 
Clone IDs: 
17878 

(Ac) cDNA Polynucleotide Sequence 
30 - Pat. Appln. SEQ ID NO 325 

- Ceres seq_id 1715423 

- Alternative transcription start site(s) located in SEQ 
ID NO 325: 

2, 4, 7, 66 

35 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 326 

- Ceres seq_id 1715424 

- Location of start within SEQ ID NO 325: at 25 nt . 

40 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 210 
45 - gi No. 4337175 

- Description: (AC006416) ESTs gb|T20589, 

gb | TO 4 64 8, gb | AA597 90 6, gb | TO 4 ill, gb | R8 4180, gb | R65 428 , 
gb | T44439, gb | T76570, gb | R90004, gb | T45020, gb | T42457, 
gb|T20921, gb|AA042762 and gb|AA720210 come from this gene. 
50 [Arabidopsis thaliana] 

- % Identity: 44.8 
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- Alignment Length: 67 

- Location of Alignment in SEQ ID NO 326: from 15 



to 81 



5 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 327 

- Ceres seq_id 1715425 

- Location of start within SEQ ID NO 325: at 40 nt . 

10 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 211 

- gi No. 4337175 

15 - Description: (AC006416) ESTs gb)T20589, 

gb | TO 4 64 8, gb | AA5 97 90 6, gb | T04111, gb|R8 4180, gb I R65428, 
gb | T4 4 439, gb I T7 657 0, gb I R90 00 4, gb | T 4 502 0, gb I T4 24 57, 
gb|T20921, gb|AA042762 and gb|AA72021C come from this gene. 
[Arabidopsis thai i ana ] 

20 - % Identity: 44.8 

- Alignment Length: 67 

- Location of Alignment in SEQ ID NO 327: from 10 

to 76 

25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 328 

- Ceres seq_id 1715426 

- Location of start within SEQ ID NO 325: at 260 nt . 

30 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 212 

- gi No. 4337175 

35 - Description: (AC006416) ESTs gb|T20589, 

gb | TO 4 64 8, gb | AA5 97 90 6, gb | T04111, gb I R84180, gb I R65428, 
gb | T44439, gb | T7 657 0, gb 1 R90004, gb J T 4 5020, gb | T42457, 
gb|T20921, gb|AA042762 and gb|AA720210 come from this gene. 
[Arabidopsis t ha liana] 

40 - % Identity: 56.1 

- Alignment Length: 57 

- Location of Alignment in SEQ ID NO 328: from 5 to 

61 

45 (Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 

Maximum Length Sequence: 
related to: 
50 Clone IDs: 
26744 
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(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 329 

- Ceres seq_id 1715962 

- Alternative transcription start site(s) located in SEQ 
5 ID NO 329: 

2,7, 9,10,26, 28,30, 31, 32, 38,39, 42 ,43, 47,48, 194 ,247 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 330 
10 - Ceres seq_id 1715963 

- Location of start within SEQ ID NO 329: at 87 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 - Ribosomal protein L6 

- Location within SEQ ID NO 330: from 12 to 194 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment: No. 213 
20 - gi No. 1710581 

- Description: 60S RIBOSOMAL PROTEIN L9 

>gi I 2129720 | pir | | S71255 ribosomal protein L9 - Arabidopsis 
thaliana >gi | 1107489 | emb I CAA63024 | (X91958) 60S ribosomal 
protein L9 [Arabidopsis thaliana] 
25 - % Identity: 93.3 

- Alignment Length: 195 

- Location of Alignment in SEQ ID NO 330: from 1 to 

194 

30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 331 

- Ceres seq_id 1715964 

- Location of start within SEQ ID NO 329: at 114 nt . 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Ribosomal protein L6 

- Location within SEQ ID NO 331: from 3 to 185 aa. 

40 (Dp) Related Amino Acid Sequences 

- Alignment No. 214 

- gi No. 1710581 

- Description: 60S RIBOSOMAL PROTEIN L9 

>gi I 2129720 | pir | | S71255 ribosomal protein L9 - Arabidopsis 
45 thaliana >gi | 1107489 I emb I CAA63024 | (X91958) 60S ribosomal 
protein L9 [Arabidopsis thaliana] 

- % Identity: 93.3 

- Alignment Length: 195 

- Location of Alignment in SEQ ID NO 331: from 1 to 

50 185 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 332 

- Ceres seq__id 1715965 

- Location of start within SEQ ID NO 329: at 369 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein L6 

- Location within SEQ ID NO 332: from 1 to 100 aa, 

(Dp) Related Amino Acid Sequences 

- Alignment No. 215 

- gi No. 1710581 

- Description: 60S RIBOSOMAL PROTEIN L9 



>gi I 
thai 
prot 



100 



2129720 |pir | | S71255 ribosomal 
iana >gi I 1107489 I emb I CAA63024 
ein L9 [Arabidopsis thaliana] 

- % Identity: 93.3 

- Alignment Length: 195 

- Location of Alignment 



protein L9 - Arabidopsis 
(X91958) 60S ribosomal 



in SEQ ID NO 332: from 



to 



Maximum Length Sequence; 

related to: 
Clone IDs: 

41712 
Public Genomic DNA: 
gi No: 4512656 
Predicted Exons: 
INIT 106774. 
INTR 106277. 
INTR 105920. 
TERM 105749. 



INTR 
INTR 
INTR 
INTR 

INIT 
INTR 
INTR 
TERM 
No 



Predicted 
INIT 
INTR 
INTR 
TERM 



106841 . 
106277 , 
105920 , 
105749, 

106774 . 
106277 , 
105920 
105749. 
4559319 



Exons : 

10855 

10358 

10001 

9830 



106633 
106194 
105838 
105630 

106633 
106194 
105838 
105453 

106633 
106194 
105838 
105630 



10714 
10275 
9919 
9711 



GENBANK 
GENBANK 
GENBANK 
GENBANK 

OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 

OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



GENBANK 
GENBANK 
GENBANK 
GENBANK 



INTR 



10922 



10714 



OCKHAMG-CDNA 
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INTR 


10358 . . . 


10275 


OCKHAMG- 


■CDNA 


INTR 


10001 ... 


9919 


OCKHAMG- 


•CDNA 


INTR 


9830 


9534 


OCKHAMG- 


•CDNA 


INIT 


10855 . . . 


10714 


OCKHAMG- 


■CDS 


INTR 


10358 . . . 


10275 


OCKHAMG- 


■CDS 


INTR 


10001 . . . 


9919 


OCKHAMG- 


•CDS 


TERM 


9830 . . . 


9711 


OCKHAMG- 


•CDS 



(Ac) cDNA Polynucleotide Sequence 
10 - Pat. Appln. SEQ ID NO 333 

- Ceres seq_id 1808584 

- Alternative transcription start site(s) located in SEQ 
ID NO 333: 

34 

15 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 334 

- Ceres seq_id 1808585 

- Location of start within SEQ ID NO 333: at 1 nt . 

20 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Helix -turn- helix 

- Location within SEQ ID NO 334: from 110 to 164 

25 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 216 

- gi No. 1632831 

30 - Description: (Z49698) orf [Ricinus communis] 

- % Identity: 81 

- Alignment Length: 142 

- Location of Alignment in SEQ ID NO 334: from 24 

to 165 

35 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 335 

- Ceres seq_id 1808586 

- Location of start within SEQ ID NO 333: at 70 nt . 

40 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Helix- turn-helix 

- Location within SEQ ID NO 335: from 87 to 141 aa. 

45 

(Dp) Related Amino Acid Sequences 

- Alignment No. 217 

- gi No. 1632831 

- Description: (Z49698) orf [Ricinus communis] 
50 - % Identity: 81 

- Alignment Length: 142 
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- Location of Alignment in SEQ ID NO 335: from 1 to 

142 

Maximum Length Sequence: 
5 related to: 
Clone IDs: 

42211 
Public Genomic DNA: 
gi No: 4914400 
10 Predicted Exons : 

INTR 90159 ... 89532 

SINGLE 90124 . . . 89615 
gi No: 4938473 
15 Predicted Exons: 

SINGLE 2368 . . . 1859 

INTR 2403 ... 1776 

20 SINGLE 2368 . . . 1859 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 336 

- Ceres seq_id 1808591 

- Alternative transcription starr site(s) located in SEQ 
25 ID NO 336: 

7, 10 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 337 
30 - Ceres seq__id 1808592 

- Location of start within SEQ ID NO 336: at 2 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 337 : at 36 aa. 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 218 

- gi No. 4926823 

40 - Description: (AC004135) TI7H7.8 [Arabidopsis 

thaliana ] 

- % Identity: 46.1 

- Alignment Length: 17 8 

- Location of Alignment in SEQ ID NO 337: from 13 

45 to 177 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 338 

- Ceres seq__id 1808593 

50 - Location of start within SEQ ID NO 336: at 38 nt . 



OCKHAMG-CDNA 
OCKHAMG-CDS 

GENBANK 

OCKHAMG-CDNA 

OCKHAMG-CDS 
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- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 338 : at 24 aa. 



(C) Nomination and Annotation of Domains within 
5 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 219 

- gi No. 4926823 

- Description: (AC004135) T17H7.8 [Arabidopsis 

10 thaliana] 

- % Identity: 46.1 

- Alignment Length: 178 

- Location of Alignment in SEQ ID NO 338: from 1 to 

165 

15 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 339 

- Ceres seq_id 1808594 

- Location of start within SEQ ID NO 336: at 164 nc. 

20 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 220 
25 - gi No. 4926823 

- Description: (AC004135) T17H7.8 [Arabidopsis 

thaliana] 

- % Identity: 46.1 

- Alignment Length: 178 

30 - Location of Alignment in SEQ ID NO 339: from 1 to 

123 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot specific gene, plant specific gene. 

35 

Maximum Length Sequence: 

related to: 
Clone IDs: 
2217 

40 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 340 

- Ceres seq_id 1920563 

- Alternative transcription start site(s) located in SEQ 
ID NO 340: 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 341 

- Ceres seq_id 1920564 

50 - Location of start within SEQ ID NO 340: at 1 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Oxidoreductase FAD/NAD-binding domain 

- Location within SEQ ID NO 341: from 45 to 167 aa. 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 221 

- gi No. 729477 

- Description: FERREDOXIN — NADP REDUCTASE PRECURSOR 
10 (FNR) >gi | 320548 | pir | | A44974 ferredoxin — NADP+ reductase (EC 

1.18.1.2) precursor - common ice plant >gi|167256 (M25528) 
f erredoxin-NADP+ reductase precursor (fnrA; EC 1.6.7.1) 
[Mesembryanthemum 

- % Identity: 89.8 

15 - Alignment Length: 197 

- Location of Alignment in SEQ ID NO 341: from 1 to 

197 

(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 342 

- Ceres seq__id 1920565 

- Location of start within SEQ ID NO 340: at 112 nt . 

(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

- Oxidoreductase FAD/NAD-binding domain 

- Location within SEQ ID NO 342: from 8 to 130 aa. 

(Dp) Related Amino Acid Sequences 
30 - Alignment No. 222 

- gi No. 729477 

- Description: FERREDOXIN — NADP REDUCTASE PRECURSOR 
(FNR) >gi | 320548 Ipir MA44974 ferredoxin — NADP+ reductase (EC 
1.18.1.2) precursor - common ice plant >gi | 167256 (M25528) 

35 f erredoxin-NADP+ reductase precursor (fnrA; EC 1.6.7.1) 
[Mesembryanthemum 

- % Identity: 89.8 

- Alignment Length: 197 

- Location of Alignment in SEQ ID NO 342: from 1 to 

40 160 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 343 

- Ceres seq__id 1920566 

45 - Location of start within SEQ ID NO 340: at 118 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Oxidoreductase FAD/NAD-binding domain 

50 - Location within SEQ ID NO 343: from 6 to 128 aa. 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 223 

- gi No. 729477 

- Description: FERREDOXIN — NADP REDUCTASE PRECURSOR 
(FNR) >gi | 320548 | pir M A44974 ferredoxin — NADP+ reductase (EC 
1.18.1.2) precursor - common ice plant >gi|167256 (M25528) 

f erredoxin-NADP+ reductase precursor (fnrA; EC 1.6.7.1) 
[Mesembryanthemum 

- % Identity: 89.8 

- Alignment Length: 197 

- Location of Alignment in SEQ ID NO 343: from 1 to 

158 



Maximum Length Sequence: 

related to: 
Clone IDs: 
13864 

Public Genomic DMA : 
gi No: 4309719 
Predicted Exons : 



INIT 


50987 . . . 


50773 


GENBANK 


INTR 


49622 . . . 


49352 


GENBANK 


INTR 


49257 . . . 


49162 


GENBANK 


INTR 


48896 ... 


48786 


GENBANK 


INTR 


48686 . . . 


48377 


GENBANK 


INTR 


48133 ... 


48043 


GENBANK 


INTR 


47924 . . . 


47591 


GENBANK 


INTR 


47475 . . . 


47327 


GENBANK 


TERM 


47084 ... 


46550 


GENBANK 


Polynucleotide 


Sequence 





- Pat. Appln. SEQ ID NO 344 

- Ceres seq_id 1974419 

- Alternative transcription start site(s) located in SEQ 
ID NO 34 4: 

-5 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 345 

- Ceres seq_id 1974420 

- Location of start within SEQ ID NO 344: at 81 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- ATPases associated with various cellular 
activities (AAA) 

- Location within SEQ ID NO 345: from 454 to 664 

aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 224 

- gi No. 4006905 



WO 00/40695 



PCT/US00/00466 



223 

- Description: (Z99708) ATPase-like protein 
[ Arabidopsis t ha liana] 

- % Identity: 84.3 

- Alignment Length: 633 

5 - Location of Alignment in SEQ ID NO 345: from 72 

to 700 

(B) Polypeptide Sequence 

-.Pat. Appln. SEQ ID NO 346 
10 - Ceres seq_id 1974421 

- Location of start within SEQ ID NO 344: at 96 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 - ATPases associated with various cellular 

activities (AAA) 

- Location within SEQ ID NO 346: from 449 to 659 

aa . 

20 (Dp) Related Amino Acid Sequences 

- Alignment No. 225 

- gi No. 4006905 

- Description: (Z99708) ATPase-like protein 
[Arabidopsis t ha liana] 

25 - % Identity: 84.3 

- Alignment Length: 633 

- Location of Alignment in SEQ ID NO 346: from 67 

to 695 

30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 347 

- Ceres seq_id 1974422 

- Location of start within SEQ ID NO 344: at 141 nt . 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- ATPases associated with various cellular 
activities (AAA) 

- Location within SEQ ID NO 347: from 434 to 644 

40 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 226 

- gi No. 4006905 

45 - Description: (Z99708) ATPase-like protein 

[Arabidopsis thaliana] 

- % Identity: 84.3 

- Alignment Length: 633 

- Location of Alignment in SEQ ID NO 347: from 52 

50 to 680 
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Maximum Length Sequence: 

related to: 
Clone IDs: 
36333 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 348 

- Ceres seq id 1975983 



10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 349 

- Ceres seq_id 1975984 

- Location of start within SEQ ID NO 348: at 90 nt . 

15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 227 

- gi No. 3688432 

20 - Description: (AJ011705) 40S ribosomal protein S10 

[Lumbricus rubellus ] 

- % Identity: 55.3 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 34 9: from 32 

25 to 192 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 350 

- Ceres seq_id 1975985 

30 - Location of start within SEQ ID NO 348: at 135 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 228 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 55.3 

40 - Alignment Length: 164 

- Location of Alignment in SEQ ID NO 350: from 17 

to 177 

(B) Polypeptide Sequence 
45 - Pat. Appln. SEQ ID NO 351 

- Ceres seq_id 1975986 

- Location of start within SEQ ID NO 348: at 183 nt . 

(C) Nomination and Annotation of Domains within 
50 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
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- Alignment No. 229 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rube 11 us ] 

5 - % Identity: 55.3 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 351: from 1 to 161 

(Ba) Polypeptide Activities: Similar to glycine-rich RNA 
10 binding ABA inducible protein activities. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
15 24587 
41072 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 352 

- Ceres seq_id 1976019 

20 - Alternative transcription start site(s) located in SEQ 

ID NO 352: 
13 

- Clone 24587 starts at 13 and ends at in cDNA. 

25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 353 

- Ceres seq_id 1976020 

- Location of start within SEQ ID NO 352: at 101 nt . 

30 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 230 

- gi No. 3688432 

35 - Description: (AJ011705) 40S ribosomal protein S10 

[Lumbricus rubellus ] 

- % Identity: 55.3 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 353: from 1 to 

40 161 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 354 

- Ceres seq_id 1976021 

45 - Location of start within SEQ ID NO 352: at 239 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
50 - Alignment No. 231 

- gi No. 3688432 
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- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 55.3 

- Alignment Length: 164 

5 - Location of Alignment in SEQ ID NO 354: from 1 to 

115 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 35 5 
10 - Ceres seq_id 1976022 

- Location of start within SEQ ID NO 352: at 287 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 (Dp) Related Amino Acid Sequences 

- Alignment No. 232 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

20 - % Identity: 55.3 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 355: from 1 to 

99 

25 (Ba) Polypeptide Activities: Extensin like protein 
activities and glycine rich protein activities. 

Maximum Length Sequence: 
related to: 
30 Clone IDs: 

117263 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 356 

- Ceres seq_id 1976673 

35 - Alternative transcription start site(s) located in SEQ 

ID NO 356: 

-46,2, 3, 6, 7 

(B) Polypeptide Sequence 
40 - Pat. Appln. SEQ ID NO 357 

- Ceres seq_id 1976674 

- Location of start within SEQ ID NO 356: at 3 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 357: at 34 aa. 

45 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S21 

- Location within SEQ ID NO 357: from 106 to 155 

50 aa . 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 233 

- gi No. 1710750 

- Description: 30S RIBOSOMAL PROTEIN S21 (BS-B) 
>gi I 1303814 | dbj | BAA12470 | (D84432) YqeX [Bacillus subtilis] 
>gi I 1890063 | dbj | BAA12082 | (D83717) YqeX [Bacillus subtilis] 
>gi I 2634987 | emb | CAB14483 I (Z99117) ribosomal protein S21 
[Bacillus subtilis ] 

- % Identity: 38.5 

- Alignment Length: 52 

- Location of Alignment in SEQ ID NO 357: from 106 

to 157 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 358 

- Ceres seq_id 1976675 

- Location of start within SEQ ID NO 356: at 48 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 358 : ar 19 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S21 

- Location within SEQ ID NO 358: from 91 to 140 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 234 

- gi No. 1710750 

- Description: 30S RIBOSOMAL PROTEIN S21 (BS-B) 
>gi I 1303814 | dbj | BAA12470 | (D84432) YqeX [Bacillus subtilis] 
>gi I 1890063 | dbj | BAA12082 | (D83717) YqeX [Bacillus subtilis] 
>gi I 2634987 | emb I CAB14483 | (Z99117) ribosomal protein S21 
[Bacillus subtilis ] 

- % Identity: 38.5 

- Alignment Length: 52 

- Location of Alignment in SEQ ID NO 358: from 91 

to 142 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 359 

- Ceres seq_id 1976676 

- Location of start within SEQ ID NO 356: at 192 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S21 

- Location within SEQ ID NO 359: from 43 to 92 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 235 

- gi No. 1710750 
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- Description: 30S RIBOSOMAL PROTEIN S21 (BS-B) 
>gi I 1303814 Idbj | BAA12470 | (D84432) YqeX [Bacillus subtilis] 
>gi I 1890063 | dbj | BAA12082 | (D83717) YqeX [Bacillus subtilis] 
>gi I 2634987 | emb | CAB14483 I (Z99117) ribosomal protein S21 
[Bacillus subtilis] 

- % Identity: 38.5 

- Alignment Length: 52 

- Location of Alignment in SEQ ID NO 359: from 43 

to 94 



Maximum Length Sequence: 

related to: 
Clone IDs: 

42333 
Public Genomic DNA: 
gi No: 6041810 
Predicted Exons: 

INTR 93582 . . . 94052 OCKHAMG-CDNA 
INTR 94169 ... 94771 OCKHAMG-CDNA 



INIT 93630 
TERM 94169 
gi No: 6091711 
Predicted Exons: 
INTR 60479 
INTR 61066 



94052 OCKHAMG-CDS 
94687 OCKHAMG-CDS 



60 94 9 OCKHAMG-CDNA 
61668 OCKHAMG-CDNA 



INIT 60527 ... 

TERM 61066 . . . 
gi No: 6102641 
Predicted Exons: 

INTR 52165 . . . 

INTR 52752 . . . 

INIT 52213 . . . 

TERM 52752 . . . 
gi No: 6453849 
Predicted Exons: 

INTR 59432 . . . 

INTR 58847 . . . 



60949 OCKHAMG-CDS 
61584 OCKHAMG-CDS 



52 63 5 OCKHAMG-CDNA 

53 35 4 OCKHAMG-CDNA 

52635 OCKHAMG-CDS 

53270 OCKHAMG-CDS 



58 964 OCKHAMG-CDNA 
582 4 3 OCKHAMG-C DNA 



OCKHAMG-CDS 
OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 360 

- Ceres seq_id 2025186 



INIT 59386 . . . 58964 
TERM 58847 . . . 58329 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 361 

- Ceres seq_id 2025187 
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- Location of start within SEQ ID NO 360: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
5 - D-isomer specific 2-hydroxyacid dehydrogenases 

- Location within SEQ ID NO 361: from 93 to 295 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 236 
10 - gi No. 5458864 

- Description: (AJ248287) GLYCERATE DEHYDROGENASE 
[ Pyro coccus abyssi ] 

- % Identity: 42.8 

- Alignment Length: 299 

15 - Location of Alignment in SEQ ID NO 361: from 36 

to 322 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 362 
20 - Ceres seq_id 2025188 

- Location of start within SEQ ID NO 360: at 49 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
25 - D-isomer specific 2-hydroxyacid dehydrogenases 

- Location within SEQ ID NO 362: from 77 to 279 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 237 
30 - gi No. 5458864 

- Description: (AJ248287) GLYCERATE DEHYDROGENASE 
[ Pyro co ecus abyssi ] 

- % Identity: 42.8 

- Alignment Length: 299 

35 - Location of Alignment in SEQ ID NO 362: from 20 

to 306 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 363 
40 - Ceres seq_id 2025189 

- Location of start within SEQ ID NO 360: at 70 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
45 - D-isomer specific 2-hydroxyacid dehydrogenases 

- Location within SEQ ID NO 363: from 70 to 272 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 238 
50 - gi No. 5458864 
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- Description: (AJ248287) 
[ Pyrococcus abyssi ] 

- % Identity: 42.8 

- Alignment Length: 299 

- Location of Alignment 

to 299 



GLYCERATE DEHYDROGENASE 



in SEQ ID NO 363: from 13 



Maximum Lengrh Sequence: 
related to : 
10 Clone IDs: 

267004 
Public Genomic DNA: 
gi No: 2924730 
Predicted Exons : 
15 INTR 8331 



8882 



OCKHAMG-CDNA 



SINGLE 8366 ... 8686 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 364 
20 - Ceres seq_id 2025372 

- Alternative transcription start site(s) 
ID NO 364 : 

21 



located in SEQ 



25 



30 



35 



40 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 365 

- Ceres seq_id 2025373 

- Location of start within SEQ 



ID NO 364: at 1 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 239 

- gi No. 132825 

- Description: 50S RIBOSOMAL PROTEIN CL25, 
CHLOROPLAST PRECURSOR >gi I 7 1308 | pir | | R5PM25 ribosomal protein 
PsCL25 precursor, chloroplast - garden pea 

>gi|20877| emb I CAA32187 | (X14022) PsCL25 ribosomal preprotein 
(AA -30 to 74) [Pisum sativum] 

- % Identity: 56.1 

- Alignment Length: 82 

- Location of Alignment in SEQ ID NO 365: from 14 

to 95 



45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 366 

- Ceres seq_id 2025374 

- Location of start within SEQ ID NO 364: at 37 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
50 ID NO 366: at 16 aa. 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 240 

- gi No. 132825 

- Description: 50S RIBOSOMAL PROTEIN CL25, 
CHLOROPLAST PRECURSOR >gi I 7 130 8 | pir | | R5 PM2 5 ribosomal protein 
PsCL25 precursor, chloroplast - garden pea 

>gi I 2 0877 | emb | CAA32187 | (X14022) PsCL25 ribosomal preprotein 
( AA -30 to 74) [Pisum sativum] 

- % Identity: 56.1 

- Alignment Length: 82 

- Location of Alignment in SEQ ID NO 366: from 2 to 

83 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 367 

- Ceres seq_id 2025375 

- Location of start within SEQ ID NO 364: at 142 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 241 

- gi No. 132825 

- Description: 50S RIBOSOMAL PROTEIN CL25, 
CHLOROPLAST PRECURSOR >gi | 7 13 0 8 | pir | | R5 PM2 5 ribosomal protein 
PsCL25 precursor, chloroplast - garden pea 

>gi|20877| emb | CAA32187 | (X14022) PsCL25 ribosomal preprotein 
(AA -30 to 74) [Pisum sativum] 

- % Identity: 56.1 

- Alignment Length: 82 

- Location of Alignment in SEQ ID NO 367: from 1 to 

48 



Maximum Length Sequence: 

related to: 
Clone IDs: 

26930 
Public Genomic DNA: 
gi No: 5225383 
Predicted Exons : 

SINGLE 64539 ... 64276 GENBANK 



INIT 64787 . 

TERM 64620 . 
gi No: 6136349 
Predicted Exons: 

INIT 126797. 

TERM 126964. 



64692 
64276 



126892 
127308 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 
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- Pat. Appln. SEQ ID NO 368 

- Ceres seq_id 2025471 

- Alternative transcription start site(s) located in SEQ 
ID NO 368 : 

5 95,96,97,98,100,101,102,105,106,110 

- Clone 26930 starts at 97 and ends at 631 in cDNA. 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 369 
10 - Ceres seq_id 2025472 

- Location of start within SEQ ID NO 368: at 1 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 (Dp) Related Amino Acid Sequences 

- Alignment No. 242 

- gi No. 5732082 

- Description: (AF162444) contains similarity to 
Solanum iycopersicum (tomato) wound induced protein 

20 (GB:X59882) [Arabidopsis thaliana] 

- % Identity: 100 

- Alignment Length: 87 

- Location of Alignment in SEQ ID NO 369: from 60 

to 146 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 370 

- Ceres seq_id 20254 . 3 

- Location of start within SEQ ID NO 368: at 178 ni: . 

30 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 243 
35 - gi No. 5732082 

- Description: (AF162444) contains similarity to 
Solanum Iycopersicum (tomato) wound induced protein 
(GB:X59882) [Arabidopsis thaliana] 

- % Identity: 100 

4 0 - Alignment Length: 87 

- Location of Alignment in SEQ ID NO 370: from 1 to 

87 



(B) Polypeptide Sequence 
45 - Pat. Appln. SEQ ID NO 371 

- Ceres seq__id 2025474 

- Location of start within SEQ ID NO 368: at 214 nt . 



(C) Nomination and 
50 Predicted Polypeptide ( s ) 
(Dp) Related Amino 



Annotation of Domains within 
Acid Sequences 
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- Alignment No. 244 

- gi No. 5732082 

- Description: (AF162444) contains similarity to 
Solanum lycopersicum (tomato) wound induced protein 

5 (GB:X59882) [Arabidopsis thaliana] 

- % Identity: 100 

- Alignment Length: 87 

- Location of Alignment in SEQ ID NO 371: from 1 to 

75 

10 

Maximum Length Sequence: 

related to: 
Clone IDs: 
27681 

15 Public Genomic DNA: 
gi No: 6434227 
Predicted Exons : 



20 



INTR 


51354 . . 


51417 


OCKHAMG- 


-CDNA 


INTR 


51529 . . 


51786 


OCKHAMG- 


•CDNA 


INIT 


47952 . . 


. 48020 


OCKHAMG- 


■CDS 


INTR 


48144 .. 


48440 


OCKHAMG- 


•CDS 


INTR 


48542 . . 


48757 


OCKHAMG- 


•CDS 


INTR 


489C8 . . . 


49027 


OCKHAMG- 


CDS 


INTR 


49117 . . 


49654 


OCKHAMG- 


•CDS 


TERM 


50075 .. 


50184 


OCKHAMG- 


•CDS 


Polynucleotide 


Sequence 







- Pat. Appln. SEQ ID NO 372 

- Ceres seq_id 2025475 

30 - Alternative transcription start site(s) located in SEQ 

ID NO 372: 

1242, 12 51, 12 70, 1271, 127 5, 12 7 6 , 12 8 0 , 128 7 , 128 8 , 12 90 , 12 92 , 12 93 , 1 
301, 13 02, 1305, 1307, 1314, 1318, 3 321, 132 5 
35 1326,1330 

- Clone 27681 starts at 1242 and ends at 2029 in cDNA . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 373 
40 - Ceres seq_id 2025476 

- Location of start within SEQ ID NO 372: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
45 (Dp) Related Amino Acid Sequences 

- Alignment No. 245 

- gi No. 2739366 

- Description: (AC002505) SF16 like protein 
[Arabidopsis thaliana ] 

50 - % Identity: 45 

- Alignment Length: 373 
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- Location of Alignment in SEQ ID NO 373: from 61 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 374 

- Ceres seq_id 2025477 

- Location of start within SEQ ID NO 372: at 316 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 24 6 

- gi No. 2739366 

- Description: (AC002505) SF16 like protein 
[ Arabidopsis t ha liana] 



- % Identity: 45 

- Alignment Length: 373 

- Location of Alignment in SEQ ID NO 374: from 1 to 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 375 

- Ceres seq_id 2025478 

- Location of start within SEQ ID NO 372: at 364 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 375: at 21 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 247 

- gi No. 2739366 

- Description: (AC002505) SF16 like protein 
[Arabidopsis thaliana] 



- % Identity: 45 

- Alignment Length: 373 

- Location of Alignment in SEQ ID NO 375: from 1 to 



Maximum Length Sequence: 
related to: 

Clone IDs: 
14564 

Public Genomic DNA: 
gi No: 4415905 
Predicted Exons : 



to 418 



313 



297 



INTR 
INTR 
INTR 
INTR 
INTR 



3235 
2694 
2509 
2319 
2076 



3181 
2619 
2408 
2171 
1784 



OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
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INIT 


3191 . . 


3181 


OCKHAMG-CDS 




INTR 


2694 


. 2619 


OCKHAMG-CDS 




INTR 


2509 


2408 


OCKHAMG-CDS 


5 


INTR 


2319 . . 


2171 


OCKHAMG-CDS 




TERM 


2076 


1920 


OCKHAMG-CDS 




gi No: 4 5 E 


51138 








Predicted 


Exons : 








INTR 


100911. . . 


100857 


OCKHAMG-CDNA 


10 


INTR 


100370 . . . 


100295 


OCKHAMG-CDNA 




INTR 


100185. . 


100084 


OCKHAMG-CDNA 




INTR 


99995 . . . 


99847 


OCKHAMG-CDNA 




INTR 


99752 . . 


99460 


OCKHAMG-CDNA 


15 


INIT 


100867 . . . 


100857 


OCKHAMG-CDS 




INTR 


100370. . . 


100295 


OCKHAMG-CDS 




INTR 


100185. . . 


100084 


OCKHAMG-CDS 




INTR 


99995 . . . 


99847 


OCKHAMG-CDS 




TERM 


99752 . . . 


99596 


OCKHAMG-CDS 


20 


(Ac) cDNA Polynucleotide 


Sequence 





- Pat. Appln. SEQ ID NO 376 

- Ceres seq_id 2025524 

- Alternative transcription start site(s) located in SEQ 
ID NO 376: 

25 -35,-2 6,-25,-2,2,3,4,5,6,7,8,9,15,16,17,32 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 377 

- Ceres seq__id 2025525 

30 - Location of start within SEQ ID NO 376: at 47 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein L24e 

35 - Location within SEQ ID NO 377: from 3 to 73 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 248 

- gi No. 6094040 

40 - Description: 60S RIBOSOMAL PROTEIN L24 

>gi I 2961300 | emb I CAA12358 | (AJ225027) ribosomal protein L24 
[Cicer arietinum] 

- % Identity: 88.3 

- Alignment Length: 163 

45 - Location of Alignment in SEQ ID NO 377: from 1 to 

163 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 378 
50 - Ceres seq__id 2025526 

- Location of start within SEQ ID NO 376: at 212 nt . 
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50 



(Dp) Related Amino Acid Sequences 

- Alignment No. 250 

- gi No. 1708292 
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- Description: HEAT-RESPONSIVE PROTEIN 12 

>gi I 1255116 (U50631) heat-responsive protein [Mus musculus] 

- % Identity: 54.3 

- Alignment Length: 129 

- Location of Alignment in SEQ ID NO 380: from 119 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 381 

- Ceres seq_id 2025546 

- Location of start within SEQ ID NO 379: at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 381: at 50 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Zinc finger, C3HC4 type (RING finger) 

- Locarion within SEQ ID NO 381: from 171 to 211 

aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 251 

- gi No. 2435518 

- Description: (AF024504) contains similarity to 
C3HC4-type zinc fingers [Arabidopsis thaliana] 

- % Identity: 31.4 

- Alignment Length: 221 

- Location of Alignment in SEQ ID NO 381: from 35 

to 211 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 382 

- Ceres seq_id 2025547 

- Location of start within SEQ ID NO 379: at 16 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 382: at 45 aa. 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Zinc finger, C3HC4 type (RING finger) 

- Location within SEQ ID NO 382: from 166 to 206 

aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 252 

- gi No. 2435518 

- Description: (AF024504) contains similarity to 
C3HC4-type zinc fingers [Arabidopsis thaliana] 

- % Identity: 31.4 

- Alignment Length: 221 
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- Location of Alignment in SEQ ID NO 382: from 30 

to 206 

Maximum Length Sequence: 
5 related to: 
Clone IDs: 
7802 

Public Genomic DNA: 
gi No: 441592C 
10 Predicted Exon^ : 

INTR 60672 . . . 61210 OCKHAMG-CDNA 

SINGLE 60712 . . . 61119 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 
15 - Pat. Appln. SEQ ID NO 383 

- Ceres seq_id 2026207 

- Alternative transcription start site(s) located in SEQ 
ID NO 383: 

2,3,4,5 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 384 

- Ceres seq_id 2026208 

- Location of start within SEQ ID NO 383: at 2 nt . 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ;s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 253 
30 - gi No. 4757974 

- Description: ref I NP__0 0 4 057.1| pCETNl | centrin, EF- 
hand protein, i >gi | 24 93 4 4 0 I sp | Q12 7 98 I CAT2_HUMAN CALTRACTIN , 
ISOFORM 2 (CENTRIN) >gi|414993 (U03270) centrin [Homo 
sapiens] 

35 - % Identity: 34.3 

- Alignment Length: 102 

- Location of Alignment in SEQ ID NO 384: from 19 

to 120 

40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 385 

- Ceres seq_id 2026209 

- Location of start within SEQ ID NO 383: at 41 nt . 

45 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 254 

- gi No. 4757974 

50 - Description: ref | NP_004 057 . 1 | pCETNl | centrin, EF- 

hand protein, 1 >gi | 2 4 934 4 0 | sp | Q127 98 | CAT2_HUMAN CALTRACTIN, 
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ISOFORM 2 (CENTRIN) >gi 1414993 (U03270) centrin [Homo 
sapiens ] 

- % Identity: 34.3 

- Alignment Length: 102 

- Location of Alignment in SEQ ID NO 385: from 6 to 

107 

(B) Polypeptide Sequence 

- Pat. ^ppln. SEQ ID NO 386 

- Ceres seq_id 2026210 

- Location of start within SEQ ID NO 383: at 62 nr. 

(C) Nomination and Annoration of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 255 

- gi No. 4757974 

- Description: ref I NP_004057 . 1 | pCETNl | centrin, EF- 
hand protein, 1 >gi | 24934 40 | sp | Q12798 | CAT2_HUMAN CALTRACTIN, 
ISOFORM 2 (CENTRIN) >gi| 414993 (U03270) centrin [Homo 
sapiens ] 

- % Identity: 34.3 

- Alignment Length: 102 

- Location of Alignment in SEQ ID NO 386: from 1 to 

100 

Maximum Length Sequence: 
Public Genomic DNA: 

gi No: 4038029 

Predicted Exons : 



INIT 


46188 . . 


46125 


GENBANK 




TERM 


46021 .. 


45852 


GENBANK 




INTR 


46258 . . 


. 46125 


OCKHAMG- 


■CDNA 


INTR 


46021 .. 


45663 


OCKHAMG- 


•CDNA 


INIT 


47157 .. 


. 47119 


OCKHAMG- 


■CDS 


INTR 


46278 . . 


46125 


OCKHAMG- 


•CDS 


TERM 


4 6021 . . 


45852 


OCKHAMG- 


•CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 387 

- Ceres seq_id 2026982 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 388 

- Ceres seq_id 2026983 

- Location of start within SEQ ID NO 387: at 1 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 388: at 73 aa. 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Gamma-thionins family 

5 - Location within SEQ ID NO 388: from 7 4 to 120 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 256 

- gi No. 1209258 

10 - Description: (L31937) protease inhibitor II 

[Brassica rapa] 

- % Identity: 87 

- Alignment Length: 77 

- Location of Alignment in SEQ ID NO 388: from 44 

15 to 120 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 38 9 

- Ceres seq_id 2026984 

20 - Location of start within SEQ ID NO 387: at 34 nr. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 389: at 54 aa . 

(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

- Gamma-thionins family 

- Location within SEQ ID NO 389: from 63 to 109 aa . 

(Dp) Related Amino Acid Sequences 
30 - Alignment No. 257 

- gi No. 1209258 

- Description: (L31937) protease inhibitor II 
[Brassica rapa] 

- % Identity: 87 

35 - Alignment Length: 77 

- Location of Alignment in SEQ ID NO 389: from 33 

to 109 

(B) Polypeptide Sequence 
40 - Pat. Appln. SEQ ID NO 390 

- Ceres seq__id 2026985 

- Location of start within SEQ ID NO 387: at 130 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 390: at 22 aa. 



45 



50 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Gamma-thionins family 

- Location within SEQ ID NO 390: from 31 to 77 aa . 
(Dp) Related Amino Acid Sequences 
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- Alignment No. 258 

- gi No. 1209258 

- Description: (L31937) protease inhibitor II 
[Brassica rapa] 

5 - % Identity: 87 

- Alignment Length: 77 

- Location of Alignment in SEQ ID NO 390: from 1 to 

77 

10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 391 

- Ceres seq_id 2026986 

- Location of start within SEQ ID NO 387: at 30 nt . 

15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

Maximum Length Sequence: 
20 related to: 
Clone IDs: 

251127 
Public Genomic DNA: 
gi No: 4220640 
25 Predicted Exons : 



30 



INIT 


67833 . . . 


68135 


OCKHAMG 


-CDS 


INTR 


68253 . . . 


58366 


OCKHAMG 


-CDS 


INTR 


68457 . . . 


68552 


OCKHAMG- 


-CDS 


INTR 


68720 . . . 


68794 


OCKHAMG- 


-CDS 


INTR 


68898 ... 


68965 


OCKHAMG- 


-CDS 


INTR 


69056 . . . 


69066 


OCKHAMG 


-CDS 


INTR 


69422 . . . 


69624 


OCKHAMG 


-CDS 


INTR 


69858 . . . 


70202 


OCKHAMG 


-CDS 


INTR 


70315 . . . 


70428 


OCKHAMG 


-CDS 


INTR 


70516 . . . 


70611 


OCKHAMG 


-CDS 


INTR 


70923 . . . 


70997 


OCKHAMG 


-CDS 


INTR 


71103 . . . 


71173 


OCKHAMG 


-CDS 


TERM 


71737 . . . 


71749 


OCKHAMG 


-CDS 



(Ac) cDNA Polynucleotide Sequence 
40 - Pat. Appln. SEQ ID NO 392 

- Ceres seq_id 2027300 

- Alternative transcription start site(s) located in SEQ 
ID NO 392: 

38, 53, 55, 63, 82, 8 4, 85, 103, 160 

45 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 393 

- Ceres seq_id 2027301 

- Location of start within SEQ ID NO 392: at 177 nt . 

50 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 259 
5 - gi No. 5020103 

- Description: (AF148219) fibrillin [Nostoc 

PCC8009] 

- % Identity : 30.3 

- Alignment Length: 122 

10 - Location of Alignment in SEQ ID NO 393: from 75 

to 189 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 394 
15 - Ceres seq_id 2027302 

- Location of starr wirhin SEQ ID NO 392: at 213 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
20 (Dp) Related Amino Acid Sequences 

- Alignment No. 260 

- gi No. 5020103 

- Description: (AF148219) fibrillin [Nostoc 

PCC8009] 

25 - % Identity: 30.3 

- Alignment Length: 122 

- Location of Alignment in SEQ ID NO 394: from 63 

to 177 

30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 395 

- Ceres seq_id 2027303 

- Location of start within SEQ ID NO 392: at 252 nt . 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 261 

- gi No. 5020103 

40 - Description: (AF148219) fibrillin [Nostoc 



PCC8009] 



45 to 164 



- % Identity: 30.3 

- Alignment Length: 122 

- Location of Alignment in SEQ ID NO 395: from 50 



Maximum Length Sequence: 

related to: 
Clone IDs: 
50 271260 

Public Genomic DNA: 
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gi No: 6056186 

Predicted Exons : 

INIT 66600 . . . 66465 
TERM 66390 . . . 66083 

gi No: 6087919 

Predicted Exons: 

INIT 31657 . . . 31522 
TERM 31447 . . . 31140 

gi No: 6119514 

Predicted Exons: 

INIT 32265 . . . 32130 
TERM 32055 . . . 31748 

gi No: 6143857 

Predicted Exons: 

INIT 34633 . . . 34768 
TERM 34843 ... 35150 

gi No: 6175136 

Predicted Exons: 

INIT 34136 . . . 34271 
TERM 34346 . . . 34653 

gi No: 6223634 

Predicted Exons: 

INIT 34137 . . . 34272 
TERM 34347 . . . 34654 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 396 

- Ceres seq_id 2027375 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 397 

- Ceres seq_id 2027376 

- Location of start within SEQ ID NO 396: at 2 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Bacterial mutT protein 

- Location within SEQ ID NO 397: from 27 to 68 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 262 

- gi No. 2622420 

- Description: (AE000895) mutator MutT protein 
[Methanobacterium thermoautot rophicum] 

- % Identity: 42.3 

- Alignment Length: 98 

- Location of Alignment in SEQ ID NO 397: from 13 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 398 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
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- Ceres seq_id 2027377 

- Location of start within SEQ ID NO 396: at 17 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Bacterial mutT protein 

- Location within SEQ ID NO 398: from 22 to 63 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 2 63 
» gi No. 2622420 

- Description: (AE000895) mutator MutT protein 
[Methanobacterium thermoautotrophicum] 

- % Identity: 42.3 

- Alignment Length: 98 

- Location of Alignment in SEQ ID NO 398: from 8 to 

104 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 399 

- Ceres seq_id 2027378 

- Location of start within SEQ ID NO 396: at 188 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 264 

- gi No. 2622420 

- Description: (AE000895) mutator MutT protein 
[Methanobacterium thermoautotrophicum] 

- % Identity: 42.3 

- Alignment Length: 98 

- Location of Alignment in SEQ ID NO 399: from 1 to 

47 

Maximum Length Sequence: 
Public Genomic DNA: 



gi No: 5868932 



Predicted 


Exons : 








INIT 


99857 . . 


96712 


GENBANK 




INTR 


96123 . . 


95883 


GENBANK 




TERM 


95642 . . 


95199 


GENBANK 




INTR 


96097 . . 


. 95883 


OCKHAMG- 


•CDNA 


INTR 


95642 . . 


95047 


OCKHAMG- 


•CDNA 


INIT 


96035 . . 


95883 


OCKHAMG- 


•CDS 


TERM 


95642 . . 


95199 


OCKHAMG- 


•CDS 



(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 400 
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- Ceres seq_id 2028729 



(B) Polypeptide Sequence 
5 - Pat. Appln. SEQ ID NO 401 

- Ceres seq_id 2028730 

- Location of start within SEQ ID NO 400: at 1 nt . 

(C) Nomination and Annotation of Domains within 
10 Predicted Polypeptide ( s ) 

- C2 domain 

- Location within SEQ ID NO 401: from 671 to 762 

aa . 

15 (Dp) Related Amino Acid Sequences 

- Alignment No. 265 

- gi No. 3142295 

- Description: (AC002411) Strong similarity ro 
phosphoribosylanthranilate transferase gb|D86180 from Pisum 

20 sativum. [Arabidopsis thaliana] 

- % Identity: 45.6 

- Alignment Length: 8 65 

- Location of Alignment in SEQ ID NO 401: from 211 

to 1050 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 402 

- Ceres seq_id 2028731 

- Location of start within SEQ ID NO 400: at 199 nt . 

30 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- C2 domain 

- Location within SEQ ID NO 402: from 605 to 696 

35 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 266 

- gi No. 3142295 

40 - Description: (AC002411) Strong similarity to 

phosphoribosylanthranilate transferase gb|D86180 from Pisum 
sativum. [Arabidopsis thaliana] 

- % Identity: 45.6 

- Alignment Length: 8 65 

45 - Location of Alignment in SEQ ID NO 402: from 145 

to 984 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 403 
50 - Ceres seq_id 2028732 

- Location of start within SEQ ID NO 400: at 232 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- C2 domain 

5 - Location within SEQ ID NO 403: from 594 to 685 

aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 267 
10 - gi No. 3142295 

- Description: (AC002411) Strong similarity to 
phosphoribosylanthranilate transferase gb|D86180 from Pisum 
sativum. [Arabidopsis thaliana] 

- % Identity: 45.6 

15 - Alignment Length: 8 65 

- Location of Alignment in SEQ ID NO 403: from 134 

to 973 

Maximum Length Sequence: 
20 related to: 
Clone IDs: 
9214 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 404 
25 - Ceres seq_id 2029079 

- Alternative transcription start site(s) located in SEQ 
ID NO 404: 

-4, -1,2, 3, 4, 8, 9, 10, 11, 14, 15, 16, 17, 18, 20, 23, 24 

30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 405 

- Ceres seq_id 2029080 

- Location of start within SEQ ID NO 404: at 3 nt . 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 268 

- gi No. 132825 

40 - Description: 50S RIBOSOMAL PROTEIN CL25, 

CHLOROPLAST PRECURSOR >gi I 71308 | pir M R5PM25 ribosomal protein 
PsCL25 precursor, chloroplast - garden pea 

>gi I 20877 | emb | CAA32187 | (X14022) PsCL25 ribosomal preprotein 
(AA -30 to 74) [Pisum sativum] 
45 - % Identity: 56.1 

- Alignment Length: 82 

- Location of Alignment in SEQ ID NO 405: from 14 

to 95 



50 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 406 
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- Ceres seq__id 2029081 

- Location of start within SEQ ID NO 404: at 39 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 406: at 16 aa . 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 269 
10 - gi No. 132825 

- Description: SOS RIBOSOMAL PROTEIN CL25, 
CHLOROPLAST PRECURSOR >gi | 7 1 3 0 8 | pir | | R5PM2 5 ribosomal protein 
PsCL25 precursor, chloroplast - garden pea 

>gi I 20877 | emb | CAA32187 | (X14022) PsCL25 ribosomal preprotein 
15 (AA -30 to 74) [Pisum sativum] 

- % Identity: 56.1 

- Alignment Length: 32 

- Location of Alignment in SEQ ID NO 406: from 2 to 

83 

20 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 407 

- Ceres seq_id 2029082 

- Location of start within SEQ ID NO 404: at 144 nt . 

25 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 270 
30 - gi No. 132825 

- Description: 50S RIBOSOMAL PROTEIN CL25, 
CHLOROPLAST PRECURSOR >gi | 7 1 3 0 8 | pir | | R5PM25 ribosomal protein 
PsCL25 precursor, chloroplast - garden pea 

>gi I 20877 | emb | CAA32187 | (X14022) PsCL25 ribosomal preprotein 
35 (AA -30 to 74) [Pisum sativum] 

- % Identity: 56.1 

- Alignment Length: 82 

- Location of Alignment in SEQ ID NO 407: from 1 to 

48 

40 

Maximum Length Sequence: 
Public Genomic DNA: 

gi No: 6434227 

Predicted Exons : 



50 



INTR 


50076 .. 


. 50370 


OCKHAMG- 


CDNA 


INTR 


50793 .. 


50874 


OCKHAMG- 


CDNA 


INTR 


51163 .. 


. 51227 


OCKHAMG- 


•CDNA 


INTR 


51331 . . 


. 51417 


OCKHAMG- 


■CDNA 


INTR 


51529 . . 


. 51786 


OCKHAMG- 


•CDNA 


INIT 


50200 . . 


50370 


OCKHAMG- 


•CDS 
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INTR 
INTR 
INTR 
TERM 



50793 
51163 
51331 
51529 



248 
50874 
51227 
51417 
51630 



OCKHAMG-CDS 
OC KHAMG -CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 408 

- Ceres seq id 2029225 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 409 

- Ceres seq_id 2029226 

- Location of start within SEQ ID NO 408: at 125 nt . 

15 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 271 
20 - gi No. 1352051 

- Description: ATP SYNTHASE D CHAIN, MITOCHONDRIAL 
>gi I 484646 | pir | | JS0739 H+-transporting ATP synthase (EC 
3.6.1.34) chain d precursor - rat >gi | 286204 | dbj | BAA02422 I 
(D13120) ATP synthase subunit d precursor [Rattus norvegicus] 

25 - % Identity: 28.6 

- Alignment Length: 145 

- Location of Alignment in SEQ ID NO 409: from 15 

to 154 



30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 410 

- Ceres seq_id 2029227 

- Location of start within SEQ ID NO 408: at 194 nt. 

35 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 272 

- gi No. 1352051 

40 - Description: ATP SYNTHASE D CHAIN, MITOCHONDRIAL 

>gi I 484646 | pir | IJS0739 H+-transporting ATP synthase (EC 
3.6.1.34) chain d precursor - rat >gi | 28 6204 I dbj | BAA02422 I 
(D13120) ATP synthase subunit d precursor [Rattus norvegicus] 

- % Identity: 28.6 

45 - Alignment Length: 145 

- Location of Alignment in SEQ ID NO 410: from 1 to 

131 



50 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 6056185 
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Predicted 
INTR 
INTR 

INTR 
INTR 



Exons ; 

21555 

22891 

22502 
22891 



INIT 21706 
TERM 22891 
gi No: 6091703 
Predicted Exons 
INTR 21555 
INTR 22891 



INTR 
INTR 

INIT 
TERM 



22502 
22891 

21706 
22891 



249 

22692 
23339 

22692 
23338 

22692 
23124 



22692 
23339 

22692 
23338 

22692 
23124 



OCKHAMG-CDNA 
OCKHAMG-CDNA 

OCKHAMG-CDNA 
OCKHAMG-CDNA 

OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDNA 
OCKHAMG-CDNA 

OCKHAMG-CDNA 
OCKHAMG-CDNA 

OCKHAMG-CDS 
OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 411 

- Ceres seq id 2029422 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 412 

- Ceres seq_id 2029423 

- Location of start within SEQ ID NO 411: at 152 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- CONSTANS family zinc finger 

- Location within SEQ ID NO 412: from 17 to 57 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 273 

- gi No. 1076301 

- Description: CONSTANS protein - Arabidopsis 
thaliana >gi | 1161514 | emb I CAA64407 I (X94937) CONSTANS protein 
[Arabidopsis thaliana] 

- % Identity: 49.2 

- Alignment Length: 63 

- Location of Alignment in SEQ ID NO 412: from 12 



to 74 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 413 

- Ceres seq_id 2029424 

- Location of start within SEQ ID NO 411: at 155 nt , 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- CONSTANS family zinc finger 

- Location within SEQ ID NO 413: from 16 to 56 aa. 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 274 

- gi No. 1076301 

- Description: CONSTANS protein - Arabidopsis 

10 thaliana >gi | 1161514 | emb I CAA64407 | (X94937) CONSTANS protein 
[Arabidopsis thaliana ] 

- % Identity: 49.2 

- Alignment Length: 63 

- Location of Alignment in SEQ ID NO 413: from 11 

15 to 73 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 414 

- Ceres seq_id 2029425 

20 - Location of start within SEQ ID NO 411: at 443 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
25 - Alignment No. 275 

- gi No. 1076301 

- Description: CONSTANS protein - Arabidopsis 
thaliana >gi | 1161514 I emb I CAA644 07 [ (X94937) CONSTANS protein 
[Arabidopsis thaliana] 

30 - % Identity: 69.8 

- Alignment Length: 43 

- Location of Alignment in SEQ ID NO 414: from 260 

to 302 

35 Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 5832738 



Predicted 


Exons : 








INTR 


56241 . . 


56150 


OCKHAMG- 


CDNA 


INTR 


56041 . . 


55864 


OCKHAMG- 


CDNA 


INTR 


55493 . . 


55317 


OCKHAMG- 


■CDNA 


INTR 


55216 


55085 


OCKHAMG- 


CDNA 


INTR 


54898 .. 


54803 


OCKHAMG- 


■CDNA 


INTR 


54725 . . 


. 54609 


OCKHAMG- 


■CDNA 


INTR 


54510 . . 


54415 


OCKHAMG- 


•CDNA 


INTR 


54288 . . 


54176 


OCKHAMG- 


•CDNA 


INTR 


54077 .. 


53779 


OCKHAMG- 


•CDNA 


INIT 


56181 .. 


56150 


OCKHAMG- 


•CDS 


INTR 


56041 . . 


55864 


OCKHAMG- 


•CDS 


INTR 


55493 .. 


. 55317 


OCKHAMG- 


■CDS 
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INTR 
INTR 
INTR 
INTR 
INTR 
TERM 



55216 
54898 
54725 
54510 
54288 
54077 



251 
55085 
54803 
54609 
54415 
54176 
53954 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



10 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 415 

- Ceres seq_id 2029806 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 416 
15 - Ceres seq_id 2029807 

- Location of start within SEQ ID NO 415: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
20 - Aminotransferase class IV 

- Location within SEQ ID NO 416: from 68 to 352 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 27 6 
25 - gi No. 3540183 

- Description: (AC004122) Highly Similar to 
branched-chain amino acid aminotransferase [Arabidopsis 
thaliana] 

- % Identity: 53.8 

30 - Alignment Length: 318 

- Location of Alignment in SEQ ID NO 416: from 55 

to 371 

(B) Polypeptide Sequence 
35 - Pat. Appln. SEQ ID NO 417 

- Ceres seq_id 2029808 

- Location of start within SEQ ID NO 415: at 61 nt . 

(C) Nomination and Annotation of Domains within 
40 Predicted Polypeptide ( s ) 

- Aminotransferase class IV 

- Location within SEQ ID NO 417: from 48 to 332 aa, 

(Dp) Related Amino Acid Sequences 
45 - Alignment No. 277 

- gi No. 3540183 

- Description: (AC004122) Highly Similar to 
branched-chain amino acid aminotransferase [Arabidopsis 
thaliana] 

50 - % Identity: 53.8 

- Alignment Length: 318 
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- Location of Alignment in SEQ ID NO 417: from 35 

to 351 

(B) Polypeptide Sequence 
5 - Pat. Appln. SEQ ID NO 418 

- Ceres seq_id 2029809 

- Location of start within SEQ ID NO 415: at 163 nt . 

(C) Nomination and Annotation of Domains within 
10 Predicted Polypeptide ( s ) 

- Aminotransferase class IV 

- Location within SEQ ID NO 418: from 14 to 298 aa . 

(Dp) Related Amino Acid Sequences 
15 - Alignment: No. 278 

- gi No. 3540183 

- Description: (AC004122) Highly Similar to 
branched-chain amino acid aminotransferase [Arabidopsis 
thaliana] 

20 - % Identity: 53.8 

- Alignment Length: 318 

- Location of Alignment in SEQ ID NO 418: from 1 to 

317 

25 Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 5822667 
Predicted Exons : 

INTR 59013 ... 59999 

30 

SINGLE 59071 . . . 59769 
gi No: 6041831 
Predicted Exons: 

INTR 53943 . . . 54929 

35 

SINGLE 54001 • . . 54699 

(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 419 
40 - Ceres seq_id 2030038 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 420 
45 - Ceres seq_id 2030039 

- Location of start within SEQ ID NO 419: at 2 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
50 (Dp) Related Amino Acid Sequences 

- Alignment No. 27 9 



OC KH AMG - C DNA 
OCKHAMG-CDS 

OCKHAMG-CDNA 
OCKHAMG-CDS 
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- gi No. 4009495 

- Description: (AF068904) YlmG [Staphylococcus 

aureus ] 

- % Identity: 31.7 

5 - Alignment Length: 63 

- Location of Alignment in SEQ ID NO 420: from 177 

to 239 



(B) Polypeptide Sequence 
10 - Pat. Appln. SEQ ID NO 421 

- Ceres seq_id 2030040 

- Location of Starr within SEQ ID NO 419: at 59 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 421 : at 19 aa. 

15 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 280 
20 - gi No. 4009495 

- Description: (AF068904) YlmG [Staphylococcus 

aureus ] 

- % Identity: 31.7 

- Alignment Length: 63 

25 - Location of Alignment in SEQ ID NO 421: from 158 

to 220 



30 



Maximum Length Sequence; 

Public Genomic DNA: 
gi No: 4539290 
Predicted Exons : 





INIT 


6134 


6132 


GEN BANK 




INTR 


6021 . . 


5973 


GENBANK 




INTR 


5588 . . 


. 5309 


GENBANK 


35 


INTR 


5226 . . 


. 5106 


GENBANK 




TERM 


4951 


. 4871 


GENBANK 




INTR 


6208 . . 


6132 


OCKHAMG-CDNA 




INTR 


6021 . . 


5973 


OCKHAMG-CDNA 


40 


INTR 


5588 


. 5309 


OCKHAMG-CDNA 




INTR 


5226 . . 


. 5106 


OCKHAMG-CDNA 




INTR 


4951 . . 


. 4724 


OCKHAMG-CDNA 




INIT 


6134 


6132 


OCKHAMG-CDS 


45 


INTR 


6021 . . 


5973 


OCKHAMG-CDS 




INTR 


5588 . . 


5309 


OCKHAMG-CDS 




INTR 


5226 . . 


. 5106 


OCKHAMG-CDS 




TERM 


4951 . . 


. 4871 


OCKHAMG-CDS 




gi No: 491 


4454 






50 


Predicted 


Exons : 








INTR 


39557 . . 


. 39481 


OCKHAMG-CDNA 
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INTR 39370 . . . 39322 
INTR 38937 . . . 38658 
INTR 38575 . . . 38455 
INTR 38300 . . . 38073 

5 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 422 

- Ceres seq_id 2031778 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 423 

- Ceres seq_id 2031779 

- Location of start within SEQ ID NO 422: at 3 nt . 

15 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 281 
20 - gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 53.1 

- Alignment Length: 164 

25 - Location of Alignment in SEQ ID NO 423: from 25 

to 186 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 424 
30 - Ceres seq_id 2031780 

- Location of start within SEQ ID NO 422: at 75 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
35 (Dp) Related Amino Acid Sequences 

- Alignment No. 282 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

40 - % Identity: 53.1 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 424: from 1 to 

162 

45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 425 

- Ceres seq_id 2031781 

- Location of start within SEQ ID NO 422: at 210 nt . 

50 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 



OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
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15 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 283 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 53.1 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 425: from 1 to 

117 

Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 6434226 
Predicted Exons : 

INTR 17873 . . . 17514 OCKHAMG-CDNA 
INTR 17046 ... 16869 OCKHAMG-CDNA 



INIT 17854 . . . 17514 OCKHAMG-CDS 
TERM 17046 ... 17028 OCKHAMG-CDS 



20 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 426 

- Ceres seq__id 2032723 



25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 427 

- Ceres seq_id 2032724 

- Location of start within SEQ ID NO 426: at 20 nt . 
30 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 427 : at 24 aa . 



(C) Nomination and Annoiation of Domains within 
Predicted Polypeptide ( s ) 
35 - Plant lipid transfer protein family 

- Location within SEQ ID NO 427: from 28 to 115 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 284 
40 - gi No. 899224 

- Description: (X60318) E2 [Brassica napus] 
>gi I 384340 | prf | I1905428A phospholipid transfer protein 
[Brassica napus] 

- % Identity: 79 

45 - Alignment Length: 119 

- Location of Alignment in SEQ ID NO 427: from 1 to 

119 



(B) Polypeptide Sequence 
50 - Pat. Appln. SEQ ID NO 428 

- Ceres seq_id 2032725 
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- Location of start within SEQ ID NO 426: at 71 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 428: from 11 to 98 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 285 

- gi No. 899224 

- Description: (X60318) E2 [Brassica napus] 
>gi I 38434C | prf | | 1905428A phospholipid transfer protein 
[Brassica napus] 

- % Identity: 79 

- Alignment Length: 119 

- Location of Alignment in SEQ ID NO 428: from 1 to 

102 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 42 9 

- Ceres seq_id 2032726 

- Location of start within SEQ ID NO 426: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 6143856 
Predicted Exons: 

INIT 30571 . . . 30464 

TERM 30327 . . . 30238 
gi No: 6223633 
Predicted Exons: 

INIT 30571 . . . 30464 

TERM 30327 . . . 30238 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 430 

- Ceres seq_id 2035536 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 431 

- Ceres seq_id 2035537 

- Location of start within SEQ ID NO 430: at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 431: at 20 aa . 



OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
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- (C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 28 6 

- gi No. 3024192 

- Description: NADH-UBIQUINONE OXIDOREDUCTASE MWFE 
SUBUNIT (COMPLEX I-MWFE) (CI-MWFE) >gi I 227 5037 | emb | CAA68 977 | 
(Y07708) NADH dehydrogenase [Mus musculus] 

. - % Identity: 31.5 

- Alignment Length: 54 

- Location of Alignment in SEQ ID NO 431: from 4 to 

56 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 432 

- Ceres seq_id 2035538 

- Location of start within SEQ ID NO 430: at 2 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 433 

- Ceres seq_id 2035539 

- Location of start within SEQ ID NO 430: at 25 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 433: at 17 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 287 

- gi No. 3024192 

- Description: NADH-UBIQUINONE OXIDOREDUCTASE MWFE 
SUBUNIT (COMPLEX I-MWFE) (CI-MWFE ) >gi | 22 7 5 0 37 | emb I CAA68 97 7 | 
(Y07708) NADH dehydrogenase [Mus musculus] 

- % Identity: 31.5 

- Alignment Length: 54 

- Location of Alignment in SEQ ID NO 433: from 1 to 

48 

Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 6136357 
Predicted Exons : 

INIT 51922 . . . 51830 OCKHAMG-CDS 
INTR 51129 ... 51055 OCKHAMG-CDS 
INTR 50958 . . . 50806 OCKHAMG-CDS 
INTR 50706 ... 50608 OCKHAMG-CDS 
TERM 50326 ... 50243 OCKHAMG-CDS 
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10 



gi No: 6143858 








Predicted 


Exons : 








INIT 


51922 . . 


jlO jU 






INTR 


51129 .. 




f~\ (~* \f TT 7\ MP 




INTR 


50958 . . 


c; n q n c 




-CDb 


INTR 


50706 . . 


jUdUo 




-UDb 


TERM 


50326 . . 


50243 


OCKHAMG 


-CDS 


gi No: 6449510 








Predicted 


Exons : 








INTR 


9035 


9111 


OCKHAMG 


-CDNA 


INTR 


9208 


9360 


OCKHAMG- 


-CDNA 


INTR 


9460 


9558 


OCKHAMG- 


-CDNA 


INTR 


9840 . . 


. 9992 


OCKHAMG- 


-CDNA 



15 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 434 

- Ceres seq_id 2035575 



20 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 435 

- Ceres seq_id 2035576 

- Location of start within SEQ ID NO 434: at 1 nt. 

25 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 288 

- gi No. 6066609 

30 - Description: (AJ009987) chloroplast channel 

forming outer membrane protein [Pisum sativum] 

- % Identity: 54.2 

- Alignment Length: 177 

- Location of Alignment in SEQ ID NO 435: from 1 to 

35 167 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 436 

- Ceres seq_id 2035577 

40 - Location of start within SEQ ID NO 434: at 13 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
45 - Alignment No. 289 

- gi No. 6066609 

- Description: (AJ009987) chloroplast channel 
forming outer membrane protein [Pisum sativum] 

- % Identity: 54.2 

50 - Alignment Length: 177 
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- Location of Alignment in SEQ ID NO 436: from 1 to 

163 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 437 

- Ceres seq__id 2035578 

- Location of start within SEQ ID NO 434: at 40 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 290 

- gi No. 6066609 

- Description: (AJ009987) chloroplast channel 
forming outer membrane protein [Pisum sativum] 

- % Identity: 54.2 

- Alignment Length: 177 

- Location of Alignment in SEQ ID NO 437: from 1 to 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 6434213 
Predicted Exons : 



INIT 
INTR 
INTR 
INTR 
TERM 



54056 
54178 
54401 
54807 
55079 



54 085 OCKHAMG-CDS 

5 4 325 OCKHAMG-CDS 

5 4 572 OCKHAMG-CDS 

54993 OCKHAMG-CDS 

55291 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 438 

- Ceres seq_id 2036457 



( B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 439 

- Ceres seq_id 2036458 

- Location of start within SEQ ID NO 438: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 439: from 104 to 188 

aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 291 

- gi No. 133940 

- Description: 40S RIBOSOMAL PROTEIN S3A (S1A) 

>gi I 70851 | pir | | R3XL3A ribosomal protein S3a - African clawed 
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(X57322) 



ribosomal protein Sla 



frog >gi | 650 91 | emb | CAA4 05 92 | 
[Xenopus laevis] 

- % Identity: 75.1 

- Alignment Length: 250 

- Location of Alignment in SEQ ID NO 439: from 1 to 

249 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 440 

- Ceres seq_id 2036459 

- Location of start within SEQ ID NO 438 



at 130 nt 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 440: from 61 to 145 aa 



>gi | 708 
frog >g 
[Xenopu 



206 



Dp) Related Amino Acid Sequences 

- Alignment No. 292 

- gi No. 133940 

- Description: 40S RIBOSOMAL PROTEIN S3A (SlA) 

51 | pir | [ R3XL3A ribosomal protein S3a - African clawed 
i I 65091 | emb I CAA40592 | (X57322) ribosomal protein Sla 
s laevis] 

- % Identity: 75.1 



- Alignment Length: 250 

- Location of Alignment 



in SEQ ID NO 440: from 1 to 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 6434218 
Predicted Exons : 



INIT 
INTR 
INTR 
INTR 
INTR 
TERM 



85292 
84903 
84730 
84524 
84282 
84148 



85282 
84828 
84629 
84376 
84156 
84065 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 441 

- Ceres seq__id 2036585 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 442 

- Ceres seq_id 2036586 

- Location of start within SEQ ID NO 441: at 1 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein L24e 

- Location within SEQ ID NO 442: from 3 to 73 aa. 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 293 

- gi No. 6094040 

- Description: 60S RIBOSOMAL PROTEIN L24 

10 >gi | 2961300 I emb | CAA12358 | (AJ225027) ribosomal protein L24 
[Cicer arietinum] 

- % Identity: 86.6 

- Alignment Length: 158 

- Location of Alignment in SEQ ID NO 442: from 1 to 

15 156 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 443 

- Ceres seq_id 2036587 

20 - Location of start within SEQ ID NO 441: at 166 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
25 - Alignment No. 294 

- gi No. 6094040 

- Description: 60S RIBOSOMAL PROTEIN L24 

>gi I 2961300 | emb | CAA12358 | (AJ225027) ribosomal protein L24 
[Cicer arietinum] 
30 - % Identity: 86.6 

- Alignment Length: 158 

- Location of Alignment in SEQ ID NO 443: from 1 to 

101 

35 Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 5668776 
Predicted Exons : 

INIT 19552 ... 19704 OCKHAMG-CDS 
40 TERM 19947 ... 20279 OCKHAMG-CDS 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 4 44 

- Ceres seq__id 2039554 

4 5 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 445 

- Ceres seq_id 2039555 

50 - Location of start within SEQ ID NO 444: at 1 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 295 

- gi No. 5882720 

- Description: (AC008263) Similar to gb|D86180 
phosphoribosylanthranilate transferase from Pisum sativum and 
contains 2 PF| 00168 C2 (phospholipid binding) domains. ESTs 
gb|H76726, gb|T45544 and gb|N96377 come from this gene. 
[Arab . . . 

- % Identity: 99.3 

- Alignment Length: 151 

- Location of Alignment in SEQ ID NO 445: from 1 to 

151 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 46 

- Ceres seq_id 2039556 

- Location of start within SEQ ID NO 444: ai 136 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 296 

- gi No. 5882720 

- Description: (AC008263) Similar to gb|D86180 
phosphoribosylanthranilate transferase from Pisum sativum and 
contains 2 PF" 00168 C2 (phospholipid binding) domains. ESTs 
gb|H76726, gb|T45544 and gb|N96377 come from this gene. 
[Arab . . . 

- % Identity: 99.3 

- Alignment Length: 151 

- Location of Alignment in SEQ ID NO 446: from 1 to 

106 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4263586 
Predicted Exons : 



INIT 
INTR 
INTR 
INTR 
TERM 



44660 
44822 
45066 
45171 
45352 



4 4 729 OCKHAMG-CDS 

4 4 985 OCKHAMG-CDS 

4 5099 OCKHAMG-CDS 

4 52 64 OCKHAMG-CDS 

45367 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 447 

- Ceres seq_id 2044283 



(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 44 8 

- Ceres seq_id 2044284 

- Location of start within SEQ ID NO 447: at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
5 ID NO 448: at 19 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
10 - Alignment No. 297 

- gi No. 4512613 

- Description: (AC004793) F28K20.12 [Arabidopsis 

thaliana] 

- % Identity: 99.2 

15 - Alignment Length: 122 

- Location of Alignment in SEQ ID NO 448: from 1 to 

122 

(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 449 

- Ceres seq_id 2044285 

- Location of start within SEQ ID NO 447: at 13 nt . 

(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 298 

- gi No. 4512613 

- Description: (AC004793) F28K20.12 [Arabidopsis 

30 thaliana] 

- % Identity: 99.2 

- Alignment Length: 122 

- Location of Alignment in SEQ ID NO 449: from 1 to 

118 



35 



40 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 450 

- Ceres seq_id 2044286 

- Location of start within SEQ ID NO 447 : at 16 nt , 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 299 
45 - gi No. 4512613 

- Description: (AC004793) F28K20.12 [Arabidopsis 

thaliana] 

- % Identity: 99.2 

- Alignment Length: 122 

50 - Location of Alignment in SEQ ID NO 450: from 1 to 

117 
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Maximum Length Sequence: 
related to: 

Clone IDs: 

157859 

Public Genomic DNA: 
gi No: 4519187 
Predicted Exons : 





INIT 


64212 . . 


S A "3 1 ft 




10 


INTR 


64435 . . 


64599 


OCKHAMG-CDS 




INTR 


64655 . . 


64789 


OCKHAMG-CDS 




INTR 


64869 . . 


64952 


OCKHAMG-CDS 




INTR 


65028 . . 


. 65143 


OCKHAMG-CDS 




INTR 


65245 . . 


65346 


OCKHAMG-CDS 


15 


INTR 


65417 . . 


. 65530 


OCKHAMG-CDS 




INTR 


65621 . . . 


65677 


OCKHAMG-CDS 




INTR 


65790 . . 


65856 


OCKHAMG-CDS 




INTR 


65929 . . 


65982 


OCKHAMG-CDS 




INTR 


66089 . . 


66163 


OCKHAMG-CDS 


20 


INTR 


66558 . . 


66593 


OCKHAMG-CDS 




INTR 


66699 . . 


66811 


OCKHAMG-CDS 




TERM 


68086 . . 


68098 


OCKHAMG-CDS 




(Ac) cDNA Polynucleotide 


Sequence 





- Pat. Appln. SEQ ID NO 451 
25 - Ceres seq__id 2048114 

- Alternative transcription start site(s) located in SEQ 
ID NO 451: 

456 

- Clone 157859 starts at 456 and ends at in cDNA. 

30 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 452 

- Ceres seq_id 2048115 

- Location of start within SEQ ID NO 451: at 1 nt . 

35 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 452: at 34 aa . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
40 (Dp) Related Amino Acid Sequences 

- Alignment No. 300 

- gi No. 4850384 

- Description: (AC007357) F3F19.3 [Arabidopsis 

thaliana] 

45 - % Identity: 66.2 

- Alignment Length: 201 

- Location of Alignment in SEQ ID NO 452: from 95 

to 295 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 453 
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- Ceres seq_id 2048116 

- Location of start within SEQ ID NO 451: at 19 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 453: at 28 aa. 

5 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 301 
10 - gi No. 4850384 

- Description: (AC007357) F3F19.3 [Arabidopsis 

thaliana] 

- % Identity: 66.2 

- Alignment Length: 201 

15 - Location of Alignment in SEQ ID NO 453: from 89 

to 289 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 454 
20 - Ceres seq_id 2048117 

- Location of start within SEQ ID NO 451: at 22 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 454 : at 27 aa. 

25 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 302 

- gi No. 4850384 

30 - Description: (AC007357) F3F19.3 [Arabidopsis 

thaliana] 

- % Identity: 66.2 

- Alignment Length: 201 

- Location of Alignment in SEQ ID NO 454: from 88 

35 to 288 

Maximum Length Sequence: 
Public Genomic DNA: 





gi No: 4519194 






40 


Predicted 


Exons : 








INTR 


15272 .. 


. 15482 


OCKHAMG-CDNA 




INTR 


15570 .. 


15615 


OCKHAMG-CDNA 




INTR 


16443 .. 


16834 


OCKHAMG-CDNA 


45 


INIT 


15350 . . 


15482 


OCKHAMG-CDS 




INTR 


15570 .. 


15615 


OCKHAMG-CDS 




TERM 


16443 .. 


16671 


OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 
50 - Pat. Appln. SEQ ID NO 455 

- Ceres seq__id 2048271 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 456 
5 - Ceres seq_id 2048272 

- Location of start within SEQ ID NO 455: at 79 nt. 

(C) Nomination and Annotation of Domains within 
Predicted Polypepticl3 (s) 
10 - Ribosomal protein S16 

- Location within SEQ ID NO 456: from 9 to 70 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 303 
15 - gi No. 3861401 

- Description: (AJ235273) 30S RIBOSOMAL PROTEIN S16 
(rpsP) [Rickettsia prowazekii] 

- % Identity: 43.7 

- Alignment Length: 103 

20 - Location of Alignment in SEQ ID NO 456: from 1 to 

102 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 457 
25 - Ceres seq_id 2048273 

- Location of start within SEQ ID NO 455: at 139 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
30 - Ribosomal protein S16 

- Location within SEQ ID NO 457: from 1 to 50 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 304 
35 - gi No. 3861401 

- Description: (AJ235273) 30S RIBOSOMAL PROTEIN S16 
(rpsP) [Rickettsia prowazekii] 

- % Identity: 43.7 

- Alignment Length: 103 

40 - Location of Alignment in SEQ ID NO 457: from 1 to 

82 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 458 
45 - Ceres seq_id 2048274 

- Location of start within SEQ ID NO 455: at 232 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
50 (Dp) Related Amino Acid Sequences 

- Alignment No. 305 
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- gi No. 3861401 

- Description: (AJ235273) 30S RIBOSOMAL PROTEIN S16 
(rpsP) [Rickettsia prowazekii] 

- % Identity: 43.7 

- Alignment Length: 103 

- Location of Alignment in SEQ ID NO 458: from 1 to 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4519195 
Predicted Exons: 



INIT 
INTR 
INTR 
INTR 
TERM 



48116 
48033 
47604 
47435 
47286 



48096 . OCKHAMG-CDS 

47989 OCKHAMG-CDS 

47531 OCKHAMG-CDS 

4738 6 OCKHAMG-CDS 

47192 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 459 

- Ceres seq_id 2048331 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 460 

- Ceres seq_id 2048332 

- Location of start within 



SEQ ID NO 459: at 1 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Domain of unknown function 

- Location within SEQ ID NO 460: from 8 to 92 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 306 

- gi No. 5032215 

- Description: ref | NP_005827 . 1 | pUKll 4 | 
translat ional inhibitor protein pl4.5 

>gi I 1717975 | sp I P52758 | UK14_HUMAN 14.5 KD 
INHIBITOR PROTEIN (P14.5) (UK114 ANTIGEN 
>gi | 117 7 4 35 | emb I CAA64670 I sapiens] 

- % Identity: 58.8 

- Alignment Length: 85 

- Location of Alignment in SEQ 

92 



TRANSLAT IONAL 
HOMOLOG) 



ID NO 460: from 8 to 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 61 

- Ceres seq_id 2048333 

- Location of start within SEQ ID NO 459: at 79 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Domain of unknown function 

- Location within SEQ ID NO 461: from 1 to 66 aa. 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 307 

- gi No. 5032215 

- Description: ref | NP_005827 . 1 | pUK114 | 
10 translational inhibitor protein pl4.5 

>gi I 1717 975 | sp | P52758 | UK14_HUMAN 14.5 KD TRANSLATIONAL 
INHIBITOR PROTEIN (P14.5) (UK114 ANTIGEN HOMOLOG) 
>gi 1117 7 4 351 emb | CAA64670 | sapiens] 

- % Identity: 58.8 

15 - Alignment Length: 85 

- Location of Alignment in SEQ ID NO 461: from 1 to 

66 

(3) Polypeptide Sequence 
20 - Pan. Appln. SEQ ID NO 4 62 

- Ceres seq_id 2048334 

- Location of start within SEQ ID NO 459: at 139 nt . 

(C) Nomination and Annotation of Domains within 
25 Predicted Polypept ide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 308 

- gi No. 5032215 

- Description: ref | NP_0 05827 . 1 | pUKl 1 4 | 
30 translational inhibitor protein pl4.5 

>gi I 1717975 | sp | P52758 I UK14_HUMAN 14.5 KD TRANSLATIONAL 
INHIBITOR PROTEIN (P14.5) (UK114 ANTIGEN HOMOLOG) 
>gi 11177 4 35 1 emb | CAA64 67 0 | sapiens] 

- % Identity: 58.8 

35 - Alignment Length: 85 

- Location of Alignment in SEQ ID NO 462: from 1 to 

46 

Maximum Length Sequence: 
40 Public Genomic DNA: 
gi No: 4589412 
Predicted Exons : 



INIT 


37105 . . 


37091 


OCKHAMG 


-CDS 


INTR 


36413 . . 


. 36339 


OCKHAMG- 


-CDS 


INTR 


36224 . . 


36176 


OCKHAMG- 


-CDS 


INTR 


35977 . . 


35698 


OCKHAMG- 


-CDS 


INTR 


35603 . . 


35471 


OCKHAMG- 


-CDS 


TERM 


35193 .. 


. 35119 


OCKHAMG 


-CDS 



50 (Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 463 
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- Ceres seq_id 2048466 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 464 

- Ceres seq_id 2048467 

- Location of start within SEQ ID NO 463: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 30 9 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 50.6 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 464: from 30 

to 193 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 65 

- Ceres seq_id 2048468 

- Location of start within SEQ ID NO 463: at 4 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 310 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 50.6 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 465: from 29 

to 192 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 466 

- Ceres seq_id 2048469 

- Location of start within SEQ ID NO 463: at 88 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 311 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 50.6 

- Alignment Length: 164 
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- Location of Alignment in SEQ ID NO 466: from 1 to 



164 



10 



Maximum Lengrh Sequence: 
Public Genomic DNA: 
gi No: 4914400 
Predicted Exons : 
INTR 97410 

SINGLE 97525 . , 
gi No: 4938473 
Predicted Exons: 

SINGLE 9769 



97 8 3 3 0 C KHAMG - C DNA 
977 97 OCKHAMG-CDS 



10041 



GENBANK 



15 



INTR 
INTR 



9654 
10721 



10 07 8 OC KHAMG -C DNA 
10 8 8 0 OC KHAMG - C DNA 



SINGLE 9769 



10041 



OCKHAMG-CDS 



20 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 467 

- Ceres seq id 2050485 



25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 468 

- Ceres seq_id 2050486 

- Location of start within SEQ ID NO 467: at 2 nt . 

30 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 312 

- gi No. 2493089 

35 - Description: ATP SYNTHASE F CHAIN, MITOCHONDRIAL 

- % Identity: 29.7 

- Alignment Length: 77 

- Location of Alignment in SEQ ID NO 4 68: from 52 

to 125 

40 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 469 

- Ceres seq_id 2050487 

- Location of start within SEQ ID NO 467: at 116 nt . 

45 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 313 
50 - gi No. 2493089 

- Description: ATP SYNTHASE F CHAIN, MITOCHONDRIAL 
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- % Identity: 


29.7 






- Alignment Length: 77 






- Location of 


Alignment 


in SEQ ID NO 


5 


to 87 








Maximum Length Sequence 








Public Genomic DNA: 








gi No: 4454022 








Predicted Exons : 






1 0 


INIT 55210 . 


55024 


GENBANK 




TERM 54477 . 


. . 54209 


GEN BANK 




INTR 55270 . 


55024 


OC KH AMG - C DNA 




INTR 54477 . 


. . 54299 


OC KH AMG - C DN A 


15 










INIT 55210 . 


55024 


OCKHAMG-CDS 




TERM 54477 . 


54209 


OCKHAMG-CDS 




(Ac) cDNA Polynucleotide Sequence 




20 


- Pat. Appln. SEQ 


ID NO 470 





from 14 



- Ceres seq_id 2050708 



25 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 471 

- Ceres seq_id 2050709 

- Location of start within 



SEQ ID NO 470: at 61 nt . 



(C) Nomination and Annotation of Domains within 
30 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 314 

- gi No. 2129641 

- Description: major latex protein type 1 - 

35 Arabidopsis thaliana >gi I 1107493 I emb I CAA63026 I (X91960) major 
latex protein typel [Arabidopsis thaliana] 

- % Identity: 71.3 

- Alignment Length: 150 

- Location of Alignment in SEQ ID NO 471: from 1 to 

40 150 



45 



50 



Maximum Length Sequence; 

Public Genomic DNA: 
gi No: 4914454 
Predicted Exons: 



INTR 
INTR 
INTR 
INTR 
INTR 
INTR 



42446 
40358 
39551 
39415 
39280 
38937 



414 56 OCKHAMG-CDS 

4 0280 OCKHAMG-CDS 

3 9481 OCKHAMG-CDS 

39322 OCKHAMG-CDS 

3 92 68 OCKHAMG-CDS 

38658 OCKHAMG-CDS 
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INTR 38575 . . . 38455 OCKHAMG-CDS 
TERM 38300 . . . 38220 OCKHAMG-CDS 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 472 

- Ceres seq__id 2050901 



(B) Polypeptide Sequence 

- Pat. £ppln. SEQ ID NO 473 

- Ceres seq_id 2050902 

- Location of start within SEQ ID NO 472: at 3 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 315 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 53.1 

- Alignment Length: 147 

- Location of Alignment in SEQ ID NO 473: from 416 

to 560 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 474 

- Ceres seq_id 2050903 

- Location of start within SEQ ID NO 472: at 123 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 316 

- gi No. 3688432 

- Description: (AJ011705) 40S ribosomal protein S10 
[Lumbricus rubellus ] 

- % Identity: 53.1 

- Alignment Length: 147 

- Location of Alignment in SEQ ID NO 474: from 376 

to 520 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 475 

- Ceres seq_id 2050904 

- Location of start within SEQ ID NO 472: at 678 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 475: at 39 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 317 

- gi No. 3688432 

- Description. 1 (AJ011705) 40S ribosomal protein S10 
[Lumbr icus rube 11 us ] 

- % Identity: 53.1 

- Alignment Length: 14 7 

- Location of Alignment in SEQ ID NO 475: from 191 

to 335 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4725940 
Predicted Exons : 
SINGLE 96089 . , 



INTR 



96156 



SINGLE 96083 
gi No: 5823567 
Predicted Exons: 

SINGLE 96089 

INTR 96156 
SINGLE 96083 



95736 GENBANK 

9 5 622 OCKHAMG-CDNA 

95736 OCKHAMG-CDS 

9 5736 GENBANK 

9 5 622 OCKHAMG-CDNA 

95736 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 476 

- Ceres seq_id 2051325 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 477 

- Ceres seq_id 2051326 

- Location of start within SEQ ID NO 476: at 68 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 477 : at 30 aa. 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 477: from 34 to 115 aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 318 

- gi No. 4725952 

- Description: (AL049730) AIRlA-like protein 
[Arabidopsis t ha liana] 

- % Identity: 100 

- Alignment Length: 117 
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- Location of Alignment in SEQ ID NO 477: from i to 

117 

(B) Polypeptide Sequence 
5 - Pat. Appln. SEQ ID NO 478 

- Ceres seq_id 2051327 

- Location of start within SEQ ID NO 476: at 74 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 478.: ai 28 aa. 

10 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

- Plant lipid transfer protein family 

- Location within SEQ ID NO 478: from 32 to 113 aa. 

15 

(Dp) Related Amino Acid Sequences 

- Alignment No. 319 

- gi No. 4725952 

- Description: (AL049730) AIRlA-like protein 
20 [Arabidopsis thaliana] 

- % Identity: 100 

- Alignment Length: 117 

- Location of Alignment in SEQ ID NO 478: from 1 to 

115 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 479 

- Ceres seq_id 2051328 

- Location of start within SEQ ID NO 476: at 288 nt . 
30 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 479: at 61 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
35 (Dp) Related Amino Acid Sequences 

Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4490717 
40 Predicted Exons : 



45 



INIT 


87492 


87378 


GENBANK 




INTR 


87034 .. 


86928 


GENBANK 




INTR 


86683 .. 


. 86641 


GENBANK 




TERM 


86445 .. 


. 86402 


GENBANK 




INTR 


87555 . . 


87378 


OCKHAMG- 


CDNA 


INTR 


87034 . . 


. 86928 


OCKHAMG- 


CDNA 


INTR 


86683 


. 86641 


OCKHAMG- 


•CDNA 


INTR 


86445 .. 


86257 


OCKHAMG- 


•CDNA 


INIT 


87492 .. 


87378 


OCKHAMG- 


•CDS 
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INTR 87034 . . . 
INTR 86683 . . . 
TERM 8 6445 ... 

5 (Ac) cDNA Polynucleotide 

- Pat. Appln. SEQ ID 

- Ceres seq_id 20516 
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8 6928 OCKHAMG-CDS 

86641 OCKHAMG-CDS 

8 64 02 OCKHAMG-CDS 

equence 
NO 480 
3 



10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 481 

- Ceres seq_id 2051634 

- Location of start within SEQ ID NO 480: at 1 nt . 

15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 320 

- gi No. 1083282 

20 - Description: cytochrome-c oxidase (EC 1.9.3.1) - 

mouse >gi | 567766 (L06465) cytochrome c oxidase [Mus musculus] 
>gi I 1094404 | prf M 2106151A cytochrome c oxidase : SUBUNIT=VIa 
[Mus musculus] 

- % Identity: 41.4 

25 - Alignment Length: 106 

- Location of Alignment in SEQ ID NO 481: from 22 

to 120 

(B) Polypeptide Sequence 
30 - Pat. Appln. SEQ ID NO 482 

- Ceres seq_id 2051635 

- Location of start within SEQ ID NO 480: at 64 nt . 

(C) Nomination and Annotation of Domains within 
35 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 321 

- gi No. 1083282 

- Description: cytochrome-c oxidase (EC 1.9.3.1) - 
40 mouse >gi 1567766 (L06465) cytochrome c oxidase [Mus musculus] 

>gi I 1094404 | prf M 2106151A cytochrome c oxidase : SUBUNIT-VIa 
[Mus musculus] 

- % Identity: 41.4 

- Alignment Length: 106 

45 - Location of Alignment in SEQ ID NO 482: from 1 to 

99 

Maximum Length Sequence: 
Public Genomic DNA: 
50 gi No: 4584531 

Predicted Exons : 
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INTR 56873 ... 56641 OCKHAMG-CDNA 
INTR 5 6627 ... 56451 OCKHAMG-CDNA 

SINGLE 56812 ... 56645 OCKHAMG-CDS 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 483 

- Ceres seq_id 2051906 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 484 

- Ceres seq_id 2051907 

- Location of start within SEQ ID NO 483: at 2 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 322 

- gi No. 4850407 

- Description: (AC007357) EST gb|T21221 comes from 
this gene. [Arabidopsis thaliana] 

- % Identity: 57.1 

- Alignment Length: 35 

- Location of Alignment in SEQ ID NO 484: from 37 

to 70 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 485 

- Ceres seq_id 2051908 

- Location of start within SEQ ID NO 483: at 62 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 323 

- gi No. 4850407 

- Description: (AC007357) EST gb|T21221 comes from 
this gene. [Arabidopsis thaliana] 

- % Identity: 57.1 

- Alignment Length: 35 

- Location of Alignment in SEQ ID NO 485: from 17 

to 50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 86 

- Ceres seq__id 2051909 

- Location of start within SEQ ID NO 483: at 74 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 324 

- gi No. 4850407 

- Description: (AC007357) EST gb|T21221 comes from 
5 this gene. [Arabidopsis thaliana] 

- % Identity: 57.1 

- Alignment Length: 35 

- Location of Alignment in SEQ ID NO 486: from 13 

to 46 

10 

Maximum Length Sequence: 
Public Genomic DNA: 

gi No: 4584841 

Predicted Exons : 



20 



INTR 


75983 . . 


. 75751 


OCKHAMG- 


CDNA 


INTR 


75412 


. 74936 


OCKHAMG- 


•CDNA 


INTR 


75992 . . 


. 75751 


OCKHAMG- 


CDNA 


INTR 


75412 .. 


. 75090 


OCKHAMG- 


CDNA 


INIT 


75937 .. 


. 75751 


OCKHAMG- 


•CDS 


TERM 


75412 . . 


. 75132 


OCKHAMG- 


•CDS 



(Ac) cDNA Polynucleotide Sequence 
25 - Pat. Appln. SEQ ID NO 487 

- Ceres seq_id 2052403 



(B) Polypeptide Sequence 
30 - Pat. Appln. SEQ ID NO 488 

- Ceres seq_id 2052404 

- Location of start within SEQ ID NO 487: at 56 nt. 

(C) Nomination and Annotation of Domains within 
35 Predicted Polypeptide ( s ) 

- Pathogenesis-related protein Bet v I family 

- Location within SEQ ID NO 488: from 5 to 155 aa. 

(Dp) Related Amino Acid Sequences 
40 - Alignment No. 325 

- gi No. 1321731 

- Description: (Z72439) major allergen Cor a 1 
[Corylus a ve liana] 

- % Identity: 36.3 

45 - Alignment Length: 159 

- Location of Alignment in SEQ ID NO 488: from 5 to 

155 



50 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4699904 
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Predicted 


Exons : 








T "M T 1 "D 
± IN 1 K 


oZOlo 


Q O O A C 

O Z O 4 3 




L, JJlNiA 


11M1 K 


p o 9 o £ 


p ^5 O A 1 


KJ ^ A. rl rilYl 


C HM ft 
L, JJiN-rt 


_L IN 1 K 


O J JO^ • • 


P 7 R £ 1 


^vj ^ i\ n. i-iixi — 


L/ DIN/A. 


INTR 


8 412 4 . . 


84 Job 


OCKHAIxIG- 


CDNA 


INIT 


82730 . . 


82845 


OCKHAMG- 


•CDS 


INTR 


83226 . . 


. 83301 


OCKHAMG- 


CDS 


INTR 


83389 


. 83561 


OCKHAMG- 


CDS 


TERM 


84124 . . 


. 84199 


OCKHAMG- 


•CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 489 

- Ceres seq_id 2053545 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 490 

- Ceres seq_id 2053546 

- Location of start within SEQ ID NO 489: at 113 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 490: at 36 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- MAPEG family 

- Location within SEQ ID NO 490: from 59 to 136 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 326 

- gi No. 4758714 

- Description: ref I NP_00451 9 . 1 | pMGST3 I microsomal 
glutathione S-transf erase 3 >gi 12583081 (AF026977) microsomal 
glutathione S-transf erase 3 [Homo sapiens] 

- % Identity: 41.9 

- Alignment Length: 131 

- Location of Alignment in SEQ ID NO 490: from 8 to 

136 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 491 

- Ceres seq_id 2053547 

- Location of start within SEQ ID NO 489: at 197 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- MAPEG family 

- Location within SEQ ID NO 491: from 31 to 108 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 327 
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- gi No. 4758714 

- Description: ref | NP_004519 . 1 | pMGST3 | microsomal 
glutathione S-transf erase 3 >gi 12583081 (AF026977) microsomal 
glutathione S-transf erase 3 [Homo sapiens] 

5 - % Identity: 41.9 

- Alignment Length: 131 

- Location of Alignment in SEQ ID NO 491: from 1 to 

108 

10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 92 

- Ceres seq_id 2053548 

- Location of start within SEQ ID NO 489: at 329 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
15 ID NO 492: at 22 aa . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- MAPEG family 

20 - Location within SEQ ID NO 492: from 1 to 64 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 328 

- gi No. 4758714 

25 - Description: ref | NP_004519 . 1 I pMGST3 | microsomal 

glutathione S-transf erase 3 >gi 12583081 (AF026977) microsomal 
glutathione S-transf erase 3 [Homo sapiens] 

- % Identity: 41.9 

- Alignment Length: 131 

30 - Location of Alignment in SEQ ID NO 492: from 1 to 

64 

Maximum Length Sequence: 
related to: 
35 Clone IDs: 
2004 
31675 
266907 
Public Genomic DNA: 
40 gi No: 4887737 

Predicted Exons : 



INTR 


7907 


8579 


OCKHAMG- 


■CDNA 


INTR 


7913 . . 


. 8587 


OCKHAMG- 


•CDNA 


INTR 


7911 


8579 


OCKHAMG- 


•CDNA 


INTR 


7909 . 


. 8579 


OCKHAMG- 


•CDNA 


INTR 


7913 . 


8578 


OCKHAMG- 


•CDNA 
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INTR 


7913 


8509 


OCKHAMG-CDNA 




SINGLE 


7958 ... 


8425 


OCKHAMG-CDS 




gi No: 5019264 






5 


Predicted 


Exons : 








SINGLE 


4143 


4610 


GEN BANK 




INTR 


4 0 92 ... 


4 7 64 


OCKHAMG-CDNA 


10 


INTR 


4098 


4772 


OCKHAMG-CDNA 




INTR 


4096 . . . 


4764 


OCKHAMG-CDNA 




INTR 


4094 


4764 


OCKHAMG-CDNA 


15 












INTR 


4098 


4763 


OCKHAMG-CDNA 




INTR 


4 0 98 


4694 


OCKHAMG-CDNA 


20 


SINGLE 


4143 . . . 


4610 


OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 493 

- Ceres seq_id 2053884 

- Alternative transcription start site(s) located in SEQ 
25 ID NO 493: 

-4 7, -3, 2, 3, 4,5, 6, 7, 8, 10, 11, 12, 13 ,14, 15, 16, 20, 21, 28, 29 
38, 44, 45, 60, 68 

- Clone 2004 starts at 1 and ends at 677 in cDNA. 

- Clone 31675 starts at 3 and ends at 607 in cDNA. 
30 - Clone 266907 starts at 4 and ends at 677 in cDNA. 



35 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 494 

- Ceres seq_id 2053885 

- Location of start within SEQ ID NO 493: at 1 nt , 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
40 - Alignment No. 329 

- gi No. 2498177 

- Description: BASP1 PROTEIN 

- % Identity: 28 

- Alignment Length: 164 

45 - Location of Alignment in SEQ ID NO 494: from 2 to 

159 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 95 

- Ceres seq_id 2053886 

- Location of start within SEQ ID NO 493: at 52 nt , 
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10 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 330 

- gi No. 2498177 

- Description: BASP1 PROTEIN 

- % Identity: 28 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 495: from 1 to 

142 



15 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 496 

- Ceres seq_id 2053887 

- Location of start within SEQ 



ID NO 493: at 172 nt . 



20 



(C) 

Predicted 
(Dp) 



25 



Nomination and Annotation of Domains within 
Polypeptide ( s ) 
Related Amino Acid Sequences 

- Alignment No. 331 

- gi No. 2498177 

- Description: BASP1 PROTEIN 

- % Identity: 28 

- Alignment Length: 164 

- Location of Alignment in SEQ ID NO 496: from 1 to 



102 





Maximum Length 


Sequence : 








30 


Public Genomic 


DNA: 










gi No: 4887738 










Predicted 


Exons : 










INTR 


45004 ... 


44725 


OCKHAMG- 


■CDNA 




INTR 


44038 . . . 


43918 


OCKHAMG- 


■CDNA 


35 


INTR 


43839 ... 


43444 


OCKHAMG- 


•CDNA 




INIT 


44942 ... 


44725 


OCKHAMG- 


-CDS 




INTR 


44038 ... 


43918 


OCKHAMG- 


•CDS 




TERM 


43839 . . . 


43564 


OCKHAMG- 


-CDS 


40 


gi No: 5103850 










Predicted 


Exons : 










INIT 


44944 . . . 


44727 


GENBANK 






INTR 


44040 ... 


43920 


GENBANK 






TERM 


43841 . . . 


43566 


GENBANK 




45 














INTR 


45006 . . . 


44727 


OCKHAMG- 


•CDNA 




INTR 


44040 ... 


43920 


OCKHAMG- 


■CDNA 




INTR 


43841 ... 


43446 


OCKHAMG- 


•CDNA 


50 


INIT 


44944 ... 


44727 


OCKHAMG- 


•CDS 




INTR 


44040 . . . 


43920 


OCKHAMG- 


-CDS 
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TERM 43841 ... 43566 OCKHAMG-CDS 

(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 4 97 
5 - Ceres seq_id 2053908 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 498 
10 - Ceres seq_id 2053909 

- Location of start within SEQ ID NO 497: at 63 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
15 (Dp) Related Amino Acid Sequences 

- Alignment No. 332 

- gi No. 2245000 

- Description: (Z97341) LET1 like protein 
[Arabidopsis thai i an a ] 

20 - % Identity: 31.4 

- Alignment Length: 156 

- Location of Alignment in SEQ ID NO 498: from 17 

to 164 

25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 499 

- Ceres seq_id 2053910 

- Location of start within SEQ ID NO 497: at 168 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
30 ID NO 499: at 23 aa. 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
35 - Alignment No. 333 

- gi No. 2245000 

- Description: (Z97341) LET1 like protein 
[Arabidopsis t ha liana] 

- % Identity: 31.4 

4 0 - Alignment Length: 15 6 

- Location of Alignment in SEQ ID NO 499: from 1 to 

129 

(B) Polypeptide Sequence 
45 - Pat. Appln. SEQ ID NO 500 

- Ceres seq__id 2053911 

- Location of start within SEQ ID NO 497: at 216 nt . 

(C) Nomination and Annotation of Domains within 
50 Predicted Polypeptide ( s ) 

(Dp) Related Amino Acid Sequences 
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- Alignment No. 334 

- gi No. 2245000 

- Description: (Z97341) 
[ Arabidopsis thai i ana ] 

- % Identity: 31.4 

- Alignment Length: 156 

- Location of Alignment 

113 



LET1 like protein 



in SEQ ID NO 500: from 1 to 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4335711 
Predicted Exons : 
INTR 46239 ., 
INTR 46624 . , 
INTR 47125 . , 



4 6470 OCKHAMG-CDNA 
4 6739 OCKHAMG-CDNA 
4 7 42 8 OCKHAMG-CDNA 



INIT 
INTR 
TERM 



46285 
46624 
47125 



4 6470 OCKHAMG-CDS 
4 6739 OCKHAMG-CDS 
47413 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 501 

- Ceres seq_id 2056123 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 502 

- Ceres seq_id 2056124 

- Location of start within SEQ ID NO 501: at 47 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Zinc finger, C3HC4 type (RING finger) 

- Location within SEQ ID NO 502: from 150 to 190 

aa . 



(Dp) Related Amino Acid Sequences 

- Alignment No. 335 

- gi No. 3790593 

- Description: (AF079185) RING-H2 finger protein 
RHYla [Arabidopsis thaliana] 

- % Identity: 55.1 

- Alignment Length: . 4 9 

- Location of Alignment in SEQ ID NO 502: from 146 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 503 

- Ceres seq_id 2056125 

- Location of start within SEQ ID NO 501: at 56 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Zinc finger, C3HC4 type (RING finger) 

- Location within SEQ ID NO 503: from 147 to 187 

aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 336 

- gi No. 3790593 

- Description: (AF079185) RING-H2 finger protein 
RHYla [Arabidopsis thaliana] 

- % Identity: 55.1 

- Alignment Length: 4 9 

- Location of Alignment in SEQ ID NO 503: from 143 

to 191 

Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4432829 



Predicted 


Exons : 








INTR 


51185 . . 


51056 


OCKHAMG- 


CDNA 


INTR 


50590 . . 


. 50253 


OCKHAMG- 


CDNA 


INIT 


56176 .. 


. 55097 


OCKHAMG- 


•CDS 


INTR 


55014 . . 


. 54161 


OCKHAMG- 


CDS 


INTR 


54076 .. 


. 54011 


OCKHAMG- 


CDS 


INTR 


51530 . . 


51382 


OCKHAMG- 


■CDS 


INTR 


51187 . . 


51056 


OCKHAMG- 


CDS 


TERM 


50590 . . 


. 50418 


OCKHAMG- 


■CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 504 

- Ceres seq__id 2056245 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 505 

- Ceres seq_id 2056246 

- Location of start within SEQ ID NO 504: at 1 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 337 

- gi No. 4938484 

- Description: (AL078464) transcription factor-like 
protein [Arabidopsis thaliana] 

- % Identity: 56.2 

- Alignment Length: 661 
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- Location of Alignment in SEQ ID NO 505: from 4 to 

644 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 506 

- Ceres seq_id 2056247 

- Location of start within SEQ ID NO 504: at 55 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s ) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 338 

- gi No. 4938484 

- Description: (AL078464) transcription factor-like 
protein [Arabidopsis thaliana] 

- % Identity: 56.2 

- Alignment Length: 661 

- Location of Alignment in SEQ ID NO 506: from 1 to 

626 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 507 

- Ceres seq_id 2056248 

- Location of start within SEQ ID NO 504: at 751 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 339 

- gi No. 4938484 

- Description: (AL078464) transcription factor-like 
protein [Arabidopsis thaliana] 

- % Identity: 56.2 

- Alignment Length: 661 

- Location of Alignment in SEQ ID NO 507: from 1 to 

394 



Maximum Length Sequence: 

related to: 
Clone IDs: 

213322 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 508 

- Ceres seq_id 1941142 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 509 

- Ceres seq_id 1941143 

- Location of start within SEQ ID NO 508: at 119 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein L24e 

- Location within SEQ ID NO 509: from 3 to 73 aa. 

5 

(Dp) Related Amino Acid Sequences 

- Alignment No. 34 0 

- gi No. 1710521 

- Description: 60S RIBOSOMAL PROTEIN L24 

10 >gi | 1154859 | emb | CAA63960 | (X94296) L24 ribosomal protein 
[Hordeum vulgare] 

- % Identity: 94.4 

- Alignment Length: 162 

- Location of Alignment in SEQ ID NO 509: from 1 to 



(B) Polypeptide Sequence 

- Pat:. Appln. SEQ ID NO 510 

- Ceres seq_id 1941144 

20 - Location of start within SEQ ID NO 508: at 284 nt . 



(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
25 - Alignment No. 341 

- gi No. 1710521 

- Description: 60S RIBOSOMAL PROTEIN L24 

>gi | 1154859 | emb | CAA63960 | (X94296) L24 ribosomal protein 
[Hordeum vulgare] 
30 - % Identity: 94.4 

- Alignment Length: 162 

- Location of Alignment in SEQ ID NO 510: from 1 to 

106 

35 Maximum Length Sequence: 
related to: 
Clone IDs: 

241379 

(Ac) cDNA Polynucleotide Sequence 
40 - Pat. Appln. SEQ ID NO 511 

- Ceres seq_id 1942975 

- Alternative transcription start site(s) located in SEQ 
ID NO 511: 

29, 30, 31, 32, 33,36 

45 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 512 

- Ceres seq_id 1942976 

- Location of start within SEQ ID NO 511: at 1 nt . 
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(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 512: from 149 to 233 

5 aa . 

(Dp) Related Amino Acid Sequences 

- Alignment No. 342 

- gi No. 133940 

10 - Description: 40S RIBOSOMAL PROTEIN S3A (S1A) 

>gi I 70851 | pir | I R3XL3A ribosomal protein S3a - African clawed 
frog >gi | 65091 I emb | CAA40592 | (X57322) ribosomal protein Sla 
[Xenopus laevis] 

- % Identity: 82.5 

15 - Alignment Length: 223 

- Location of Alignment in SEQ ID NO 512: from 46 

to 268 

(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 513 

- Ceres seq__id 1942977 

- Location of start within SEQ ID NO 511: at 136 nt . 

(C) Nomination and Annotation of Domains within 
25 Predicted Polypeptide ( s ) 

- Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 513: from 104 to 188 

aa . 

30 (Dp) Related Amino Acid Sequences 

- Alignment No. 343 

- gi No. 133940 

- Description: 40S RIBOSOMAL PROTEIN S3A (S1A) 

>gi I 70851 | pir | | R3XL3A ribosomal protein S3a - African clawed 
35 frog >gi | 65091 | emb I CAA40592 | (X57322) ribosomal protein Sla 
[Xenopus laevis] 

- % Identity: 82.5 

- Alignment Length: 223 

- Location of Alignment in SEQ ID NO 513: from 1 to 

40 223 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 514 

- Ceres seq__id 1942978 

45 - Location of start within SEQ ID NO 511: at 205 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S3, C-terminal domain. 

50 - Location within SEQ ID NO 514: from 81 to 165 aa. 
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(Dp) Related Amino Acid Sequences 

- Alignment No. 344 

- gi No. 133940 

- Description: 40S RIBOSOMAL PROTEIN S3A (S1A) 

5 >gi | 70851 | pir | | R3XL3A ribosomal protein S3a - African clawed 
frog >gi | 65091 | emb | CAA40592 I (X57322) ribosomal protein Sla 
[Xenopus laevis] 

- % Identity: 82.5 

- Alignment Length: 223 

10 - Location of Alignment in SEQ ID NO 514: from 1 to 

200 

Maximum Length Sequence: 
related to: 
15 Clone IDs: 

289536 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 515 

- Ceres seq_id 1944349 

2 0 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 516 

- Ceres seq_id 1944350 

25 - Location of start within SEQ ID NO 515: at 423 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein L24e 

30 - Location within SEQ ID NO 516: from 3 to 73 aa. 

(Dp) Related Amino Acid Sequences 

- Alignment No. 34 5 

- gi No. 1710521 

35 - Description: 60S RIBOSOMAL PROTEIN L24 

>gi I 1154859 I emb | CAA63960 | (X94296) L24 ribosomal protein 
[Hordeum vulgare] 

- % Identity: 94.4 

- Alignment Length: 162 

40 - Location of Alignment in SEQ ID NO 516: from 1 to 

161 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 517 
45 - Ceres seq_id 1944351 

- Location of start within SEQ ID NO 515: at 588 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
50 (Dp) Related Amino Acid Sequences 

- Alignment No. 34 6 
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- gi No. 1710521 

- Description: 60S RIBOSOMAL PROTEIN L24 

>gi I 1154859 | emb | CAA63960 | (X94296) L24 ribosomal protein 
[Hordeum vulgare] 
5 - % Identity: 94.4 

- Alignment Length: 162 

- Location of Alignment in SEQ ID NO 517: from 1 to 

106 



10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 518 

- Ceres seq_id 1944352 

- Location of start within SEQ ID NO 515: at 736 nt . 



15 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 347 

- gi No. 1362587 

20 - Description: spermatid-specif ic protein T2 

precursor - longfin squid 

- % Identity: 55.8 

- Alignment Length: 52 

- Location of Alignment in SEQ ID NO 518: from 1 to 



Maximum Length Sequence: 

related to: 
Clone IDs: 
30 291258 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 519 

- Ceres seq_id 1964011 

- Alternative transcription start site(s) located in SEQ 
35 ID NO 519: 

4, 34, 37, 38, 39, 40,41 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 520 
40 - Ceres seq__id 1964012 

- Location of start within SEQ ID NO 519: at 3 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 
45 - Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 520: from 150 to 234 

aa . 



50 



(Dp) Related Amino Acid Sequences 

- Alignment No. 348 

- gi No. 1173253 
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- Description: 40S RIBOSOMAL PROTEIN S3 
>gi I 543317 | pir | | S4 1170 ribosomal protein S3 - mouse 

>gi I 57728 | emb | CAA35916 | (X51536) ribosomal protein S3 (AA 1- 
243) [Rattus rattus] >gi | 4 39522 | emb | CAA54 1 67 | (X76772) 
5 ribosomal protein S3 [Mus musculus] 

- % Identity: 81.1 

- Alignment Length: 227 

- Location of Alignment in SEQ ID NO 520: from 47 

to 273 

10 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 521 

- Ceres seq_id 1964013 

- Location of start within SEQ ID NO 519: at 141 nt . 

15 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

- Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 521: from 104 to 188 

20 aa. 



(Dp) Related Amino Acid Sequences 

- Alignment No. 34 9 

- gi No. 1173253 

25 - Description: 40S RIBOSOMAL PROTEIN S3 

>gi I 543317 |pir | | S41170 ribosomal protein S3 - mouse 
>gi I 57728 | emb | CAA35916 | (X51536) ribosomal protein S3 (AA 1- 
243) [Rattus iattus] >gi I 439522 | emb | CAA54167 | (X76772) 
ribosomal protein S3 [Mus musculus] 

30 - % Identity: 81.1 

- Alignment Length: 227 

- Location of Alignment in SEQ ID NO 521: from I to 

227 

35 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 522 

- Ceres seq_id 1964014 

- Location of start within SEQ ID NO 519: at 210 nt . 



40 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

- Ribosomal protein S3, C-terminal domain. 

- Location within SEQ ID NO 522: from 81 to 165 aa. 



45 (Dp) Related Amino Acid Sequences 

- Alignment No. 350 

- gi No. 1173253 

- Description: 40S RIBOSOMAL PROTEIN S3 
>gi I 543317 | pir M S41170 ribosomal protein S3 - mouse 

50 >gi | 57728 | emb | CAA35916 | (X51536) ribosomal protein S3 (AA 1- 
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243) [Rattus rattus] >gi I 439522 | emb | CAA54167 | (X76772) 
ribosomal protein S3 [Mus musculus] 

- % Identity: 81.1 

- Alignment Length: 227 t 

5 - Location of Alignment in SEQ ID NO 522: from 1 to 

204 

Maximum Length Sequence : 
related to: 
10 Clone IDs: 

207986 

224937 

227089 

229580 
15 241662 

275880 

278084 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 523 
20 - Ceres seq_id 1983854 

- Alternative transcription start site(s) located in SEQ 
ID NO 523: 

-1,2, 6, 30, 31, 32, 33, 34, 36 

- Clone 207986 starts at 31 and ends at 810 in cDNA. 
25 - Clone 224937 starts at 31 and ends at 810 in cDNA. 

- Clone 227089 starts at 33 and ends at 804 in cDNA. 

- Clone 229580 starts at 31 and ends at 821 in cDNA. 

- Clone 241662 starts at 34 and ends c 815 in cDNA. 

- Clone 275880 starts at 1 and ends at 804 in cDNA. 
30 - Clone 278084 starts at 31 and ends at 815 in cDNA. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 524 

- Ceres seq_id 1983855 

35 - Location of start within SEQ ID NO 523: at 315 nt . 

(C) Nomination and Annotation of Domains within 
Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 
40 - Alignment No. 351 

- gi No. 1710521 

- Description: 60S RIBOSOMAL PROTEIN L24 

>gi I 1154859 | emb | CAA63960 | (X94296) L24 ribosomal protein 
[Hordeum vulgare] 
45 - % Identity: 94 

- Alignment Length: 150 

- Location of Alignment in SEQ ID NO 524: from 1 to 

106 



50 Maximum Length Sequence: 
related to: 
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Clone IDs: 

221977 
226126 
293001 

5 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 525 

- Ceres seq_id 1990261 

- Alternative transcription start - site ( s ) located in SEQ 
ID NO 52 5: 

10 16,30,32 

- Clone 226126 starts at 30 and ends at 772 in cDNA. 

- Clone 293001 starts at 32 and ends at 810 in cDNA. 

(B) Polypeptide Sequence 
15 - Pat. Appln. SEQ ID NO 526 

- Ceres seq_id 2061972 

- Location of start within SEQ ID NO 525: an 144 nt . 

(C) Nomination and Annotation of Domains within 
20 Predicted Polypeptide ( s ) 

- Ribosomal protein L24e 

- Location within SEQ ID NO 526: from 3 to 73 aa. 

(Dp) Related Amino Acid Sequences 
25 - Alignment No. 352 

- gi No. 1710521 

- Description: 60S RIBOSOM7L PROTEIN L24 

>gi I 1154859 I emb | CAA63960 | (X94296) L2^ ribosomal protein 
[Hordeum vulgare] 
30 - % Identity: 94.4 

- Alignment Length: 162 

- Location of Alignment in SEQ ID NO 526: from 1 to 

161 

35 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 527 

- Ceres seq_id 2061973 

- Location of start within SEQ ID NO 525: at 309 nt . 

40 (C) Nomination and Annotation of Domains within 

Predicted Polypeptide (s) 

(Dp) Related Amino Acid Sequences 

- Alignment No. 353 

- gi No. 1710521 

4 5 - Description: 60S RIBOSOMAL PROTEIN L24 

>gi I 1154859 | emb | CAA63960 | (X94296) L24 ribosomal protein 
[Hordeum vulgare] 

- % Identity: 94.4 

- Alignment Length: 162 

50 - Location of Alignment in SEQ ID NO 527: from 1 to 

106 
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REF TABLE 2 

Maximum Length Sequence: 
related to: 
5 Clone IDs: 
9458 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 1 

- Ceres seq_id 1007802 

10 - Alternative transcription start site(s) located in SEQ 

ID NO 1: 

2,3,4 



(B) Polypeptide Sequence 
15 - Pat. Appln. SEQ ID NO 2 

- Ceres seq_id 1007803 

- Location of start within SEQ ID NO 1: at 3 nt . 



(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 3 

- Ceres seq_id 1007804 

- Location of stan within SEQ ID NO 1: at 2 nt. 

- Location of Signal Pepride Cleavage Site within SEQ 
ID NO 3: at 29 aa . 

25 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 4 

- Ceres seq_id 1007805 

- Location of start within SEQ ID NO 1: at 29 nt . 

30 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 4 : at 20 aa . 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

35 specific gene, plant 

specific gene. 

Maximum Length Sequence: 
related to: 
40 Clone IDs: 
8458 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 5 

- Ceres seq_id 1008556 

45 - Alternative transcription start site(s) located in SEQ 

ID NO 5: 

80, 103, 134, 135 ,136, 137 ,138, 139, 152, 153, 157, 166, 168 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 6 

- Ceres seq_id 1008557 
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- Location of start within SEQ ID NO 5: at 173 nt . 
(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

5 specific gene, plant 

specific gene. 

Maximum Length Sequence: 
related to: 
10 Clone IDs: 
8384 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 7 

- Ceres seq_id 1008628 

15 - Alternative transcription start site(s) located in SEQ 

ID NO 7 : 

3, 28 

(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 8 

- Ceres seq_id 1008629 

- Location of start within SEQ ID NO 7: at 41 nt . 

(Ba) Polypeptide Activities: Arabidopsis specific 
25 gene, dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 
30 related to: 
Clone IDs: 
7260 

Public Genomic DNA: 
gi No: 5733889 



35 


Predicted Exons : 










INIT 21286 . . . 


21509 


GENB7ANK 






INTR 21606 . . . 


21832 


GENBANK 






TERM 21951 . . . 


21958 


GENBANK 




40 


INIT 21286 . . . 


21509 


OCKHAMG- 


-CDS 




INTR 21606 . . . 


21832 


OCKHAMG- 


■CDS 




TERM 21951 . . . 


21958 


OCKHAMG- 


■CDS 




gi No: 6041810 










Predicted Exons : 








45 


INIT 34915 . . . 


35138 


OCKHAMG- 


-CDS 




INTR 35235 . . . 


35461 


OCKHAMG- 


■CDS 




TERM 35580 . . . 


35587 


OCKHAMG- 


■CDS 




gi No: 6091711 










Predicted Exons : 








50 


INIT 100255. . . 


100478 


OCKHAMG- 


•CDS 




INTR 100575... 


100801 


OCKHAMG- 


-CDS 
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(Ac) 



TERM 100920 
gi No: 6102641 



Predicted 
INIT 
INTR 
TERM 



Exons : 
91941 
92261 
92606 



gi No: 6453849 
Predicted Exons: 

INIT 19658 . . , 
INTR 19338 . . , 
TERM 18993 . . , 
cDNA Polynucleotide 



295 
100927 



92164 
92487 
92613 



19435 
19112 
18986 
Sequence 



- Pat. Appln. SEQ ID NO 9 

- Ceres seq_id 1009376 

- Alternative transcription 
ID NO 9: 

4, 7, 22, 48, 56,77 



OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



start site(s) located in SEQ 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 10 

- Ceres seq_id 1009377 

- Location of start within SEQ ID NO 9: at 238 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 10: at 76 aa . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 11 

- Ceres seq_id 1009378 

- Location of start within SEQ ID NO 9: at 334 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 11: at 44 aa . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 12 

- Ceres seq_id 1009379 

- Location of start within SEQ ID NO 9: at 245 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 12: at 47 aa . 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
4489 

(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 13 
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- Ceres seq_id 1011128 

- Alternative transcription start site(s) located in SEQ 
ID NO 13: 

6,7 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 14 

- Ceres seq_id 1011129 

- Location of start within SEQ ID NO 13: at 41 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 15 

- Ceres seq_id 1011130 

- Location of start within SEQ ID NO 13: at 59 nt . 
(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
42241 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 16 

- Ceres seq_id 1011718 

(B) Polypepride Sequence 

- Pat. Appln. SEQ ID NO 17 

- Ceres seq_id 1011719 

- Location of start within SEQ ID NO 16: at 3 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 18 

- Ceres seq_id 1011720 

- Location of start within SEQ ID NO 16: at 78 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 18 : ar 21 aa . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 19 

- Ceres seq_id 1011721 

- Location of start within SEQ ID NO 16: at 102 nt. 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

specific gene, plant 

specific gene. 
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Maximum Length Sequence: 
related to: 
5 Clone IDs: 
42200 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 20 
10 - Ceres seq__id 1011735 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 21 

- Ceres seq_id 1011736 

15 - Location of stari within SEQ ID NO 20: at 75 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 22 

- Ceres seq_id 1011737 

20 - Location of start within SEQ ID NO 20: at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 22 : at 37 aa. 

(B) Polypeptide Sequence 
25 - Pat. Appln. SEQ ID NO 23 

- Ceres seq_id 1011738 

- Location of start within SEQ ID NO 20: at 2 nt . 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
30 dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 
35 related to: 
Clone IDs: 
42217 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 24 
40 - Ceres seq_id 1011755 

- Alternative transcription start site(s) located in SEQ 
ID NO 24: 

34 

45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 25 

- Ceres seq_id 1011756 

- Location of start within SEQ ID NO 24: at 1 nt. 



50 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 26 
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- Ceres seq_id 1011757 

- Location of start within SEQ ID NO 24: at 19 nr. 

(Ba) Polypeptide Activities: Arabidopsis specific 
5 gene, dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 
10 related to: 
Clone IDs: 
42034 

(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 27 
15 - Ceres seq_id 1011832 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 28 

- Ceres seq_id 1011833 

20 - Location of srarr within SEQ ID NO 27: at 97 nr. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 29 

- Ceres seq_id 1011834 

25 - Location of start within SEQ ID NO 27: at 100 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 30 

- Ceres seq_id 1011835 

30 - Location of start within SEQ ID NO 27: at 106 nt . 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

specific gene, plant 

35 specific gene. 



Maximum Length Sequence: 
related to: 
40 Clone IDs: 
41812 

Public Genomic DNA: 
gi No: 6434227 
Predicted Exons : 
45 INIT 65854 . . . 65974 OCKHAMG-CDS 

TERM 66067 . . . 66206 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 31 

- Ceres seq_id 1011907 



(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID; NO 32 

- Ceres seq_id 1011908 

- Location of start within SEQ ID NO 31: at 218 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 32 : at 30 aa . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 33 

- Ceres seq_id 1011909 

- Location of start within SEQ ID NO 31: "at 239 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 33: at 23 aa. 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 34 

- Ceres seq__id 1011910 

- Location of start within SEQ ID NO 31: at 3 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 34 : at 17 aa . 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence 

related to: 
Clone IDs: 
41828 

Public Genomic DNA: 
gi No: 4159707 
Predicted Exons : 
INTR 
INTR 
INTR 
INTR 



15084 
15295 
15414 
15648 



(Ac) 



15201 
. 15324 
15533 
15956 

cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 35 

- Ceres seq_id 1011911 



OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 36 

- Ceres seq_id 1011912 

- Location of start within SEQ ID NO 35: at 194 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 37 

- Ceres seq_id 1011913 

- Location of start within SEQ ID NO 35: at 230 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 38 

- Ceres seq_id 1011914 

5 - Location of start within SEQ ID NO 35: at 2 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 38 : at 28 aa. 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 
10 dicot 

specific gene, plant 

specific gene. 



15 Maximum Length Sequence: 
related to: 
Clone IDs: 

41723 ' 

(Ac) cDNA Polynucleotide Sequence 
20 - Pat. Appln. SEQ ID NO 39 

- Ceres seq_id 1011954 



(B) Polypeptide Sequence 
25 - Pat. Appln. SEQ ID NO 40 

- Ceres seq_id 1011955 

- Location of start within SEQ ID NO 39: at 1 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 40: at 40 aa. 

30 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 41 

- Ceres seq_id 1011956 

- Location of start within SEQ ID NO 39: at 107 nt . 

35 

(Ba) Polypeptide Activities: Similar to DNAJ Protein 
Activities 



Maximum Length Sequence: 
40 related to: 
Clone IDs: 

41730 
Public Genomic DNA: 
gi No: 4678371 
45 Predicted Exons : 

SINGLE 48996 ... 49184 GENBANK 



SINGLE 48996 ... 49184 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 
50 - Pat. Appln. SEQ ID NO 42 

- Ceres seq_id 1011960 
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- Alternative transcription start site(s) located in SEQ 
ID NO 42: 

20,22,25,43 

5 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 43 

- Ceres seq_id 1011961 

- Location of start within SEQ ID NO 42: at 69 nt . 

10 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 44 

- Ceres seq_id 1011962 

- Location of start within SEQ ID NO 42: at 78 nt . 

15 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 45 

- Ceres seq_id 1011963 

- Location of start within SEQ ID NO 42: at 90 nt . 

20 (Ba) Polypeptide Activities: Arabidopsis specific 

gene, dicot 

specific gene, plant specific gene, similar to 
serine rich protein 

activities. 

25 

Maximum Length Sequence: 

related to: 
Clone IDs: 
38977 

30 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 46 

- Ceres seq_id 1014075 

- Alternative transcription start site(s) located in SEQ 
ID NO 46: 

35 -3,-2,-1,2,3,4,5,6,7,8,10,15,103 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 47 

- Ceres seq_id 1014076 

40 - Location of start within SEQ ID NO 46: at 1 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 48 

- Ceres seq_id 1014077 

45 - Location of start within SEQ ID NO 46: at 84 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 48: at 22 aa. 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
50 dicot 
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specific gene, plant 

specific gene. 

Maximum Length Sequence: 
5 related to: 
Clone IDs: 

37081 
Public Genomic DNA: 
gi No: 4159707 
10 Predicted Exons : 

INIT 15458 ... 15533 OCKHAMG-CDS 
TERM 15648 ... 15751 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 49 
15 - Ceres seq_id 1015865 

- Alternative transcription start site(s) located in SEQ 
ID NO 49: 

-29 

20 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 50 

- Ceres seq__id 1015866 

- Location of start within SEQ ID NO 49: at 281 nt . 

25 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 51 

- Ceres seq_id 1015867 

- Location of start within SEQ ID NO 49: at 317 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
30 ID NO 51: at 22 aa. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 52 

- Ceres seq_id 1015868 

35 - Location of start within SEQ ID NO 49: at 3 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 52: at 15 aa . 

(Ba) Polypeptide Activities: Arabidopsis specific 
40 gene, dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 
45 related to: 
Clone IDs: 

29375 
Public Genomic DNA: 
gi No: 4159707 
50 Predicted Exons: 

INTR 15158 ... 15201 OCKHAMG-CDNA 
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INTR 15449 ... 15533 OCKHAMG-CDNA 
INTR 15648 ... 15947 OCKHAMG-CDNA 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 53 

- Ceres seq_id 1021371 

- Alternative transcription start site(s) located in SEQ 
ID NO 53: 

-74 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 54 

- Ceres seq_id 1021372 

- Location of start within SEQ ID NO 53: at 3 nt . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 55 

- Ceres seq_id 1021373 

- Location of start within SEQ ID NO 53: at 54 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 56 

- Ceres seq_id 1021374 

- Location of start within SEQ ID NO 53: at 90 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 56: at 22 aa. 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
27064 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 57 

- Ceres seq_id 1022578 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 58 

- Ceres seq_id 1022579 

- Location of start within SEQ ID NO 57: at 1 nt . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 59 

- Ceres seq_id 1022580 

- Location of start within SEQ ID NO 57: at 157 nt . 



(B) 



Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 60 

- Ceres seq_id 1022581 

- Location of start within 



SEQ ID NO 57: at 10 nt 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence: 
related to: 

Clone IDs: 
23773 

Public Genomic DNA: 
gi No: 6136357 
Predicted Exons : 



(Ac) 



INTR 
INTR 
INTR 
INTR 
INTR 
No: 



51972 
51129 
50948 
50706 
50326 
6143858 



Predicted 
INTR 
INTR 
INTR 
INTR 
INTR 



Exons : 

51972 

51129 

50948 

50706 

50326 



cDNA Polynucleotide S 

- Pat. Appln. SEQ ID 

- Ceres seq_id 102424 



51830 
51055 
50806 
50608 
50147 



51830 
51055 
50806 
50608 
50147 

equence 

NO 61 

0 



OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 



OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 62 

- Ceres seq_id 1024241 

- Location of start within SEQ ID NO 61: at 54 nt . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 63 

- Ceres seq_id 1024242 

- Location of start within SEQ ID NO 61: at 66 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 64 

- Ceres seq_id 1024243 

- Location of start within SEQ ID NO 61: at 404 nt. 

(Ba) Polypeptide Activities: Arabidopsis specific 
gene, dicot 

specific gene, plant 

specific gene. 
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Maximum Length Sequence: 

related to: 
Clone IDs: 
2030 

Public Genomic DNA: 
gi No: 5441914 
Predicted Exons: 

SINGLE 4802 
gi No: 5881519 
Predicted Exons: 

SINGLE 4802 



4449 



4449 



OCKHAMG-CDS 



GENBANK 



SINGLE 4802 ... 4449 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 65 

- Ceres seq_id 1026562 

- Alternative transcription start site(s) 
ID NO 65: 

36, 41, 43, 44,88, 90, 94, 102 



located in SEQ 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 66 

- Ceres seq_id 1026563 

- Location of start within SEQ ID NO 65: at 1 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 66: at 51 aa. 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 67 

- Ceres seq_id 1026564 

- Location of start within SEQ ID NO 65: at 109 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 67: at 15 aa. 

(Ba) Polypeptide Activities: Similar to cytochrome c 
oxidase II 

activities . 



Maximum Length Sequence: 

related to: 
Clone IDs: 

20097 
Public Genomic DNA: 
gi No: 5733893 
Predicted Exons: 

INTR 66948 ... 67429 OCKHAMG-C DNA 



SINGLE 66986 ... 67324 OCKHAMG-CDS 
gi No: 5748882 
Predicted Exons: 
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INTR 41416 ... 40935 OCKHAMG-CDNA 

SINGLE 41380 ... 41042 OCKHAMG-CDS 
gi No: 5801671 
Predicted Exons : 

INTR 51184 ... 50703 OCKHAMG-CDNA 



SINGLE 51148 
gi No: 5809708 
Predicted Exons: 

INTR 41416 



50810 OCKHAMG-CDS 



4 0 935 OCKHAMG- C DNA 



SINGLE 41380 . . . 
gi No: 5836114 
Predicted Exons: 

INTR 42487 . . . 

SINGLE 42451 . . . 
gi No: 5923662 
Predicted Exons: 

INTR 43106 . . . 



41042 OCKHAMG-CDS 

42 0 0 6 OCKHAMG - C DNA 

42113 OCKHAMG-CDS 

4 358 7 OCKHAMG - C DNA 



SINGLE 43144 ... 43482 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 68 

- Ceres seq_id 1026648 

- Alternative transcription start site(s) located in SEQ 
ID NO 68: 

-74, -72, -48, -42, 76 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 69 

- Ceres seq_id 1026649 

- Location of start within SEQ ID NO 68: at 39 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 69: at 59 aa. 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 70 

- Ceres seq__id 1026650 

- Location of start within SEQ ID NO 68: at 78 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 70: at 46 aa. 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 71 

- Ceres seq__id 1026651 

- Location of start within SEQ ID NO 68: at 81 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 71: at 45 aa . 
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(Ba) Polypeptide Activities: Arabidopsis specific 
gene, dicot 

specific gene, plant 

specific gene. 

5 

Maximum Length Sequence: 

related to: 
Clone IDs: 
18274 

10 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 72 

- Ceres seq_id 1027881 

- Alternative transcription start site(s) located in SEQ 
ID NO 72: 

15 -8,-4,2 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 73 

- Ceres seq_id 1027882 

20 - Location of start within SEQ ID NO 72: at 3 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 74 

- Ceres seq_id 1027883 

25 - Location of start within SEQ ID NO 72: at 36 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 75 

- Ceres seq_id 1027884 

30 - Location of start within SEQ ID NO 72: at 105 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 75: at 18 aa. 

(Ba) Polypeptide Activities: Arabidopsis specific 
35 gene, dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 
40 related to: 
Clone IDs: 

121894 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 7 6 
45 - Ceres seq__id 1381797 

- Alternative transcription start site(s) located in SEQ 
ID NO 76: 

-41,18, 19, 22, 26 



50 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 77 
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- Ceres seq__id 1381798 

- Location of start within SEQ ID NO 76: at 70 nt . 

(B) Polypeptide Sequence 
5 - Pat. Appln. SEQ ID NO 78 

- Ceres seq_id 1381799 

- Location of start within SEQ ID NO 76: at 82 nt. 

(B) Polypeptide Sequence 
10 - Pat. Appln. SEQ ID NO 79 

- Ceres seq_id 1381800 

- Location of start within SEQ ID NO 76: at 109 nt . 

(Ba) Polypeptide Activities: Arabidopsis specific 
15 gene, dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 
20 related to: 
Clone IDs: 
31667 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 80 
25 - Ceres seq_id 1442747 

- Alternative transcription start site(s) located in SEQ 
ID NO 80: 

3, 5, 6, 11, 62, 64 ,65 ,66,69,70,71, 72 ,74 

30 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 81 

- Ceres seq_id 1442748 

- Location of start within SEQ ID NO 80: at 2 nt . 

35 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 82 

- Ceres seq_id 1442749 

- Location of start within SEQ ID NO 80: at 140 nt . 

40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 83 

- Ceres seq_id 1442750 

- Location of start within SEQ ID NO 80: at 224 nt . 

45 (Ba) Polypeptide Activities: Arabidopsis specific 

gene, dicot 

specific gene, plant 

specific gene. 



50 Maximum Length Sequence: 
related to: 



WO 00/40695 



PCT/US00/00466 



309 

Clone IDs: 

270354 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 84 

- Ceres seq__id 1459199 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 85 

- Ceres seq_id 1459200 

- Location of start within SEQ ID NO 84: at 2 nt. 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 86 

- Ceres seq_id 1459201 

- Location of start within SEQ ID NO 84: at 56 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 86: at 20 aa . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 87 

- Ceres seq_id 1459202 

- Location of start within SEQ ID NO 84: at 80 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 87: at 17 aa. 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 
dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
12250 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 88 

- Ceres seq_id 1565605 

- Alternative transcription start site(s) located in SEQ 
ID NO 8 8: 

-17 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 89 

- Ceres seq_id 1565606 

- Location of start within SEQ ID NO 88: at 133 nt . 



(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 90 

- Ceres seq_id 1565607 

- Location of start within SEQ ID NO 88: at 181 nt . 
(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicer 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
97883 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 91 

- Ceres seq_id 1566686 

- Alternative transcription start site(s) located in SEQ 
ID NO 91: 

58, 61 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 92 

- Ceres seq_id 1566687 

- Location of start within SEQ ID NO 91: at 137 nt . 
(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
12408 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 93 

- Ceres seq_id 1567367 

- Alternative transcription start site(s) located in SEQ 
ID NO 93: 

-39, 6, 11, 14, 17, 25, 32, 33, 34, 35, 60 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 94 

- Ceres seq_id 1567368 

- Location of start within SEQ ID NO 93: at 1 nt. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 95 

- Ceres seq_id 1567369 

- Location of start within SEQ ID NO 93: at 2 nt. 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 96 

- Ceres seq__id 1567370 

- Location of start within 



SEQ ID NO 93: at 65 nt , 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 

specific gene, plant 



dicot 
specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 
19528 

Public Genomic DNA: 
gi No: 6091770 
Predicted Exons : 

INIT 

INTR 

INTR 

TERM 
gi No: 6102636 
Predicted Exons 

INIT 

INTR 

INTR 

TERM 
gi No: 6137907 
Predicted Exons : 

INIT 

INTR 

INTR 

TERM 
gi No: 6437519 
Predicted Exons 

INIT 

INTR 

INTR 

TERM 

(Ac) cDNA Polynucleotide 

- Pat. Appln. SEQ I 

- Ceres seq_id 1570 

- Alternative trans 
ID NO 97: 

9, 29, 30, 31, 32, 39, 



21323 
21467 
21995 
22386 



21323 
21467 
21995 
22386 



21252 
21396 
21924 
22315 



14107 
14251 
14779 
15170 



21390 
21564 
22027 
22468 



21390 
21564 
22027 
22468 



21319 
21493 
21956 
22397 



14174 
14348 
14811 
15252 
Sequence 

D NO 97 

101 

cript ion 



OC KHAMG -CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



start site(s) located in SEQ 
40, 43, 44, 61, 62, 64, 65, 198 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 98 

- Ceres seq_id 1570102 

- Location of start within 



SEQ ID NO 97: at 116 nt 
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(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

specific gene, plant 

specific gene. 

5 

Maximum Length Sequence: 
related to: 
Clone IDs: 

42399' 

10 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 99 

- Ceres seq_id 1571051 



15 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 100 

- Ceres seq_id 1571052 

- Location of Starr within SEQ ID NO 99: at 1 nt . 



20 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 101 

- Ceres seq_id 1571053 

- Location of start within SEQ ID NO 99: at 16 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
25 ID NO 101: at 30 aa. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 102 

- Ceres seq_id 1571054 

- Location of start within SEQ ID NO 99: at 139 nt. 

30 

(Ba) Polypeptide Activities: Arabidopsis specific 
gene, dicot 

specific gene, plant 

specific gene. 

3 5 

Maximum Length Sequence: 

related to: 
Clone IDs: 
6487 

40 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 103 

- Ceres seq__id 1571100 

- Alternative transcription start site(s) located in SEQ 
ID NO 103: 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 104 

- Ceres seq_id 1571101 

50 - Location of start within SEQ ID NO 103: at 1 nt. 
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- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 104 : at 19 aa . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 105 

- Ceres seq_id 1571102 

- Location of start within SEQ ID NO 103: at 82 nt . 

(Ba) Polypeptide Activities: Arabidopsis specific 
gene , . dicor 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 

related to: 
Clone IDs: 

39977 

14890 

34623 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 106 

- Ceres seq_id 1665272 

- Alternative transcription start site(s) located in SEQ 
ID NO 106: 

3,15, 32 ,33 ,34, 35, 37 ,38,39,40,41, 42 ,43,44,45, 46,47,48,49, 50 
51, 60, 61, 63, 66, 69,70, 71, 72, 74, 75, 76,78,79,84,85 

- Clone 39977 starts at 35 and ends at 636 in cDNA. 

- Clone 14890 starts at 43 and ends at in cDNA. 

- Clone 34623 starts at 1 and ends at 598 in cDNA. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 107 

- Ceres seq__id 1665273 

- Location of start within SEQ ID NO 106: at 90 nt . 
(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

specific gene, plant 

specific gene. 

Maximum Length Sequence: 

related to: 
Clone IDs: 
24562 

(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 108 

- Ceres seq_id 1713895 

- Alternative transcription start site(s) located in SEQ 
ID NO 108 : 

-7, 2, 3, 5, 7, 8, 13, 17, 2 9, 30, 32, 34 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 109 

- Ceres seq_id 1713896 

- Location of start within SEQ ID NO 108: at 3 nt . 

5 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 110 

- Ceres seq_id 1713897 

- Location of start within SEQ ID NO 108: at 57 nt . 
10 - Location of Signal Peptide Cleavage Site within SEQ 

ID NO 110: at 20 aa . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 111 
15 - Ceres seq_id 1713898 

- Location of start within SEQ ID NO L08: at 81 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 111 : at 17 aa . 

20 (Ba) Polypeptide Activities: Arabidcpsis specific gene, 

dicot 

specific gene, plant 

specific gene. 

25 Maximum Length Sequence: 
related to: 
Clone IDs: 
41879 

(Ac) cDNA Polynucleotide Sequence 
30 - Pat. Appln. SEQ ID NO 112 

- Ceres seq_id 1923752 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 113 
35 - Ceres seq_id 1923753 

- Location of start within SEQ ID NO 112: at 2 nt. 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 113: at 19 aa . 

40 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 114 

- Ceres seq__id 1923754 

- Location of start within SEQ ID NO 112: at 292 nt . 

45 (B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 115 

- Ceres seq_id 1923755 

- Location of start within SEQ ID NO 112: at 384 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
50 ID NO 115: at 27 aa . 
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(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 

specific gene, plant 

specific gene. 

5 

Maximum Length Sequence: 

related to: 
Clone IDs: 

150069 

10 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 116 

- Ceres seq_id 1976816 

- Alternative transcription start site(s) located in SEQ 
ID NO 116: 

15 20,30,32 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 117 

- Ceres seq_id 1976817 

20 - Location of start within SEQ ID NO 116: at 2 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 117: at 61 aa. 



(B) Polypeptide Sequence 
25 - Pat. Appln. SEQ ID NO 118 

- Ceres seq_id 1976818 

- Location of start within SEQ ID NO 116: at 3 nt . 

(B) Polypeptide Sequence 
30 - Pat. Appln. SEQ ID NO 119 

- Ceres seq_id 1976819 

- Location of start within SEQ ID NO 116: at 52 nt . 
(Ba) Polypeptide Activities: Arabidopsis specific gene, 

35 dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence: 
40 related to: 

Clone IDs: 

109997 
Public Genomic DNA: 
gi No: 4263813 
45 Predicted Exons : 

INIT 82737 ... 82740 OCKHAMG-CDS 
TERM 83475 ... 83623 OCKHAMG-CDS 
(Ac) cDNA Polynucleotide Sequence 
- Pat. Appln. SEQ ID NO 120 
50 - Ceres seq_id 2025128 
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- Alternative transcription start site(s) located in SEQ 
ID NO 120: 

-5,5, 61, 88 

- Clone 109997 starts at 5 and ends at 566 in cDNA . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 121 

- Ceres seq_id 2025129 

- Location of start within SEQ ID NO 120: at 1 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 122 

- Ceres seq_id 2025130 

- Location of start within SEQ ID NO 120: at 3 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 122: at 21 aa . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 123 

- Ceres seq_id 2025131 

- Location of start within SEQ ID NO 120: at 316 nt . 

(Ba) Polypeptide Activities: Arabidopsis specific 
gene, dicot 

specific gene, plant 

specific gene. 



Maximum Length Sequence 

related to: 
Clone IDs: 

41792 
Public Genomic DNA: 
gi No: 5762549 
Predicted Exons : 

INTR 41272 . 
gi No: 5776585 
Predicted Exons: 

INTR 55175 . 
gi No: 5801669 
Predicted Exons: 

INTR 97802 . 
gi No: 5809709 
Predicted Exons: 

INTR 97803 . 
gi No: 5932531 
Predicted Exons: 

INTR 90652 . 
gi No: 6013612 
Predicted Exons: 

INTR 9143 
gi No: 6016671 



417 4 9 OCKHAMG-CDNA 

5 5 652 OCKHAMG-CDNA 

9827 9 OCKHAMG-CDNA 

9 82 8 0 OCKHAMG-CDNA 

9112 9 OCKHAMG- C DNA 

9 62 0 OCKHAMG-C DNA 
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Predicted Exons : 

INTR 9266 ... 9743 OCKHAMG-CDNA 
gi No: 6041828 
Predicted Exons: 

INTR 9266 ... 9743 OCKHAMG-CDNA 
gi No: 6175159 
Predicted Exons: 

INTR 92644 ... 92167 OCKHAMG-CDNA 
(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 124 

- Ceres seq_id 2025402 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 125 

- Ceres seq_id 2025403 

- Location of start within SEQ ID NO 124: at 186 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 126 

- Ceres seq_id 2025404 

- Location of start within SEQ ID NO 124: at 2 nt . 



(Ba) Polypeptide Activities: Similar to ribose 5- 
phosphate isomerase 

Maximum Length Sequence: 

related to: 
Clone IDs: 

28572 
Public Genomic DNA: 

gi No: 4725940 

Predicted Exons: 



INIT 


99197 . . . 


99167 


OCKHAMG- 


•CDS 


INTR 


99060 . . . 


98960 


OCKHAMG- 


-CDS 


INTR 


98863 ... 


98753 


OCKHAMG- 


•CDS 


INTR 


98654 ... 


97823 


OCKHAMG- 


•CDS 


INTR 


97730 . . . 


97477 


OCKHAMG- 


•CDS 


INTR 


96772 . . . 


96683 


OCKHAMG- 


-CDS 


TERM 


96157 . . . 


96086 


OCKHAMG- 


•CDS 


No: 5823567 








dieted 


Exons : 








INIT 


99197 . . . 


99167 


OCKHAMG- 


•CDS 


INTR 


99060 ... 


98960 


OCKHAMG- 


•CDS 


INTR 


98863 ... 


98753 


OCKHAMG- 


CDS 


INTR 


98654 ... 


97823 


OCKHAMG- 


•CDS 


INTR 


97730 . . . 


97477 


OCKHAMG- 


CDS 


INTR 


96772 . . . 


96683 


OCKHAMG- 


CDS 


TERM 


96157 . . . 


96086 


OCKHAMG- 


CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 127 

- Ceres seq_id 2025479 
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- Alternative transcription start site(s) located in SEQ 
ID NO 127 : 

1419 

- Clone 28572 starts at 1419 and ends at 1955 in cDNA. 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 128 

- Ceres seq_id 2025480 

- Location of start within SEQ ID NO 127: at 1 nt . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 129 

- Ceres seq_id 2025481 

- Location of start within SEQ ID NO 127: at 226 nt . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 130 

- Ceres seq__id 2025482 

- Location of start within SEQ ID NO 127: at 271 nt . 



(Ba) Polypeptide Activities: Similar Cell Wall Plasma 
Membrane disconnecting CLCT Protein activities, and proline 
rich protein activities. 



Maximum Length Sequence; 

Public Genomic DNA: 
gi No: 6 "34247 
Predicted Exons : 
INTR 46663 

SINGLE 46579 . . 



4 6055 OC KHAMG- C DNA 
4 6103 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 131 

- Ceres seq_id 2032963 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 132 

- Ceres seq_id 2032964 

- Location of start within SEQ ID NO 131: at 85 nt 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 

specific gene, plant 



dicot 
specific gene 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 6449509 
Predicted Exons: 
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INIT 32110 . . . 

INTR 32363 . . . 

INTR 33006 . . . 

TERM 33182 . . . 
gi No: 6453848 
Predicted Exons : 

INIT 34795 . . . 

INTR 35048 . . . 

INTR 35691 . . . 

TERM 35867 . . . 
gi No: 6456153 
Predicted Exons: 

INIT 32305 . . . 

INTR 32558 . . . 

INTR 33201 . . . 

TERM 33377 . . . 
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32118 OCKHAMG-CDS 

32538 OCKHAMG-CDS 

330 8 4 OCKHAMG-CDS 

33247 OCKHAMG-CDS 



34803 OCKHAMG-CDS 

35223 OCKHAMG-CDS 

357 69 OCKHAMG-CDS 

35932 OCKHAMG-CDS 



32313 OCKHAMG-CDS 

32733 OCKHAMG-CDS 

3327 9 OCKHAMG-CDS 

334 42 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 133 

- Ceres seq_id 2033706 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 134 

- Ceres seq_id 2033707 

- Location of start within SEQ ID NO 133: at 1 nt . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 13 

- Ceres seq_id 2033708 

- Location of start within 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 13 

- Ceres seq_id 2033709 

- Location of start within 

(Ba) Polypeptide Activities : 

dicot 

specific gene. 



SEQ ID NO 133: at 34 nt. 

SEQ ID NO 133: at 160 nt . 
Arabidopsis specific gene, 
specific gene, plant 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4512646 
Predicted Exons: 

INTR 43130 . . . 43387 OCKHAMG-CDNA 

INTR 43479 ... 43638 OCKHAMG-CDNA 



INIT 43224 ... 43387 
TERM 43479 . . . 43494 



OCKHAMG-CDS 
OCKHAMG-CDS 
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(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 137 

- Ceres seq__id 2043118 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 138 

- Ceres seq_id 2043119 

- Location of start within SEQ ID NO 137: at 65 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 138: at 22 aa. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 139 

- Ceres seq_id 2043120 

- Location of start within SEQ ID NO 137: at 1 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 140 

- Ceres seq_id 2043121 

- Location of start within SEQ ID NO 137: at 95 nt . 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 

specific gene, plant 



dicot 
specific gene 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 3985934 
Predicted Exons : 
INTR 40094 .. 
INTR 40861 . . 



INIT 
TERM 



40233 
40861 



4 0516 OCKHAMG-CDNA 

412 7 5 OCKHAMG-CDNA 

4 0516 OCKHAMG-CDS 

41077 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 141 

- Ceres seq__id 2047214 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 142 

- Ceres seq_id 2047215 

- Location of start within SEQ ID NO 141: at 140 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 143 

- Ceres seq_id 2047216 

- Location of start within SEQ ID NO 141: at 224 nt . 



(B) Polypeptide Sequence 
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- Pat. Appln. SEQ ID NO 144 

- Ceres seq_id 2047217 

- Location of start within SEQ ID NO 141: at 153 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
NO 14 4 : at 18 aa. 



(Ba) Polypeptide Activities: 



dicot 



specific gene. 



Arabidopsis specific gene 
specific gene, plant 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 3985958 
Predicted Exons: 
INTR 8598 
INTR 8960 
INTR 9202 
INTR 9628 



8 69 6 OCKHAMG - C DNA 
90 35 OCKHAMG-CDNA 

9 513 OCKHAMG-C DNA 
9 6 69 OCKHAMG-CDNA 



INIT 
INTR 
TERM 



8997 
9202 
9628 



9035 OCKHAMG-C DS 

9513 OCKHAMG -CDS 

9822 OCKHAMG-C DS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 145 

- Ceres seq__id 2047438 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 146 

- Ceres seq_id 2047439 

- Location of start within SEQ ID NO 145: at 137 nt 



Maximum Length Sequence; 

Public Genomic DNA: 
gi No: 4757403 
Predicted Exons: 
INTR 35281 

SINGLE 35235 . , 



34 7 81 OCKHAMG - C DNA 
35005 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 147 

- Ceres seq_id 2049056 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 148 

- Ceres seq_id 2049057 

- Location of start within SEQ ID NO 147: at 47 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 148 : at 22 aa . 
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(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 149 

- Ceres seq_id 2049058 

- Location of start within SEQ ID NO 147: at 59 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 14 9: at 18 aa. 



10 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4538972 
Predicted Exons : 



15 



20 



INTR 


32316 


32561 


OCKHAMG- 


•CDNA 


INTR 


32643 


. 32715 


OCKHAMG- 


-CDNA 


INTR 


33489 


. 33697 


OCKHAMG- 


•CDNA 


INTR 


33789 .. 


. 33935 


OCKHAMG- 


•CDNA 


INTR 


34034 


. 34365 


OCKHAMG- 


■CDNA 


INIT 


32328 . . 


32561 


OCKHAMG- 


CDS 


INTR 


32643 .. 


32715 


OCKHAMG- 


CDS 


INTR 


33489 .. 


33697 


OCKHAMG- 


CDS 


INTR 


33789 .. 


. 33935 


OCKHAMG- 


CDS 


TERM 


34034 


. 34159 


OCKHAMG- 


CDS 



25 



30 



35 



40 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 150 

- Ceres seq_id 2050386 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 151 

- Ceres seq_id 2050387 

- Location of start within SEQ ID NO 150: at 1 nt. 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 152 

- Ceres seq_id 2050388 

- Location of start within SEQ ID NO 150: at 13 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 153 

- Ceres seq_id 2050389 

- Location of start within SEQ ID NO 150: at 151 nt . 



45 



(Ba) Polypeptide Activities: Arabidopsis specific gene, 

specific gene, plant 



dicot 
specific gene 



50 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4878038 
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Predicted Exons: 
INTR 22640 
INTR 22412- 
INTR 22122 



INTR 
INTR 
INTR 



22674 
22412 
22122 



INIT 22587 
INTR 22412 
TERM 22122 
gi No: 6143825 
Predicted Exons : 
INTR 22639 
INTR 22411 
INTR 22121 



INTR 
INTR 
INTR 

INIT 
INTR 
TERM 



22673 
22411 
22121 

22586 
22411 
22121 
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22509 
22340 
21728 

22509 
22340 
21766 

22509 
22340 
21852 



22508 
22339 
21727 

22508 
22339 
21765 

22508 
22339 
21851 



OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 

OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 

OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



OCKHAMG-CDNA 
OCKHAMG-CDNA 
OCKHAMG-CDNA 



OCKHAMG- 
OCKHAMG- 
OCKHAMG- 



CDNA 
CDNA 
CDNA 



OCKHAMG-CDS 
OCKHAMG-CDS 
OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 154 

- Ceres seq_id 2053353 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 155 

- Ceres seq_id 2053354 

- Location of start within SEQ ID NO 154: at 88 nt . 



(Ba) Polypeptide Activities 



dicot 

specific gene. 



Arabidopsis specific gene, 
specific gene, plant 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4115352 
Predicted Exons: 



INIT 
INTR 
INTR 
INTR 
TERM 



29289 
27681 
27312 
26979 
26654 



29287 OCKHAMG-CDS 

27485 OCKHAMG-CDS 

27193 OCKHAMG-CDS 

2 6916 OCKHAMG-CDS 

2 664 6 OCKHAMG-CDS 
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(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 156 

- Ceres seq_id 2055693 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 157 

- Ceres seq_id 2055694 

- Location of start within SEQ ID NO 156: at 1 nt . 

'(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 158 

- Ceres seq_id 2055695 

- Location of start within SEQ ID NO 156: at 46 nt . 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 159 

- Ceres seq__id 2055696 

- Location of start within SEQ ID NO 156: at 2 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 159: at 28 aa . 



gene, dicot 
specific gene 



(Ba) Polypeptide Activities: Arabidopsis specific 

specific gene, plant 



Maximum Length Sequence: 
Public Genomic DNA: 
gi No: 4406790 
Predicted Exons : 
INTR 81640 . . 

SINGLE 81603 . . 



8124 7 O C KHAMG- C DNA 
813 67 OCKHAMG-CDS 



(Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 160 

- Ceres seq_id 2056405 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 161 

- Ceres seq_id 2056406 

- Location of start within SEQ ID NO 160: at 38 nt, 

(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 162 

- Ceres seq_id 2056407 

- Location of start within SEQ ID NO 160: at 56 nt . 



(B) Polypeptide Sequence 

- Pat. Appln. SEQ ID NO 163 
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- Ceres seq_id 2056408 

- Location of start within SEQ ID NO 160: at 177 nt . 

- Location of Signal Peptide Cleavage Site within SEQ 
ID NO 163: at 14 aa. 

(Ba) Polypeptide Activities: Arabidopsis specific gene, 

dicot 



specific gene, plant 



specific gene. 



Maximum Length Sequence: 

related to: 
Clone IDs: 

233233 

15 (Ac) cDNA Polynucleotide Sequence 

- Pat. Appln. SEQ ID NO 164 

- Ceres seq_id 2065747 

(B) Polypeptide Sequence 
20 - Pat. Appln. SEQ ID NO 165 

- Ceres seq_id 2065748 

- Location of start within SEQ ID NO 164: at 114 nt . 

(B) Polypeptide Sequence 
25 - Pat. Appln. SEQ ID NO 166 

- Ceres seq_id 2065749 

- Location of start within SEQ ID NO 164: at 279 nt . 
(Ba) Polypeptide Activities: Arabidopsis specific gene, 

30 dicot 

specific gene, plant 

specific gene. 
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SEQ TABLE 1 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..601 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007546 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



gacgaacagg ccacgcgtga ctacatacat aatttggttt ctcttttttt ttttgttctt 60 

cttgttcctc tttggttttt gacattggtt ctggagaaga tgagttcggt agaaccagac 120 

atggaagatc tgttccagga gaagaagcgt gtcaggaatc ctctcgttcc tctcggtgca 180 

cttatgaccg cgggagtgct cacggctggg ttgattagtt tcagaagagg caattctcag 240 

ttgggtcagg ttttgatgag agctagggtg gtcgtccagg gtgctactgt cgctttaatg 300 

gttggaaccg gttattacta cggtgataat ccgtggaaga agtgagctcc aactacttgt 360 

tcgttttcac ataattgaga tgacactcgg gaaatctgcg ggcaaaggtg tgtgtgctct 420 

cgctctgagt gtagaaatat gcggtcttgc taggatttag taagggtaca ttacatgatt 480 

tgaataattt gaagaacgaa caagtatcgt tgtatcttga tcaattgatt agattgattt 540 

aaggaaattc tccgaaaacc atctttactc tatctgtcaa aatctcatta tctttctgcc 600 



g 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 114 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007547 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Asp 


Glu 


Gin 


Ala 


Thr 


Arg 


Asp 


Tyr 


He 


His 


Asn 


Leu 


Val 


Ser 


Leu 


Phe 


1 








5 










10 










15 




Phe 


Phe 


Val 


Leu 


Leu 


Val 


Pro 


Leu 


Trp 


Phe 


Leu 


Thr 


Leu 


Val 


Leu 


Glu 








20 










25 










30 






Lys 


Met 


Ser 


Ser 


Val 


Glu 


Pro 


Asp 


Met 


Glu 


Asp 


Leu 


Phe 


Gin 


Glu 


Lys 






35 










40 










45 








Lys 


Arg 


Val 


Arg 


Asn 


Pro 


Leu 


Val 


Pro 


Leu 


Gly 


Ala 


Leu 


Met 


Thr 


Ala 




50 










55 










60 










Gly 


Val 


Leu 


Thr 


Ala 


Gly 


Leu 


He 


Ser 


Phe 


Arg 


Arg 


Gly 


Asn 


Ser 


Gin 


65 










70 










75 










80 


Leu 


Gly 


Gin 


Val 


Leu 


Met 


Arg 


Ala 


Arg 


Val 


Val 


Val 


Gin 


Gly 


Ala 


Thr 










85 










90 










95 




Val 


Ala 


Leu 


Met 


Val 


Gly 


Thr 


Gly 


Tyr 


Tyr 


Tyr 


Gly Asp 


Asn 


Pro 


Trp 








100 










105 










110 







Lys Lys 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 81 
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(D) OTHER INFORMATION: / Ceres Seq. ID 1007548 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



ixie l 




Ser 


val 


t^_l_U 


Pro 


Asp 


Met 




Asp 


Leu 


tr ne 


pi n 
tjj_n 


pi n 


Lys 


Lys 


1 








5 










10 










15 




Arg 


Val 


Arg 


Asn 


Pro 


Leu 


Val 


Pro 


Leu 


Gly 


Ala 


Leu 


Met 


Thr 


Ala 


Gly 








20 










25 










30 






Val 


Leu 


Thr 


Ala 


Gly 


Leu 


He 


Ser 


Phe 


Arg 


Arg 


Gly 


Asn 


Ser 


Gin 


Leu 






35 










40 










45 








Gly 


Gin 


Val 


Leu 


Met 


Arg 


Ala 


Arg 


Val 


Val 


Val 


Gin 


Gly 


Ala 


Thr 


Val 




50 










55 










60 










Ala 


Leu 


Met 


Val 


Gly 


Thr 


Gly 


Tyr 


Tyr 


Tyr 


Gly 


Asp 


Asn 


Pro 


Trp 


Lys 


65 










70 










75 










80 



Lys 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 



(A) NAME/KEY: peptide 

(B) LOCATION: 1. .74 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007549 





(xi) 


SEQUENCE DESCRIPTION: : 


seq : 


ID NO:4: 












Met 


Glu 


Asp Leu 


Phe 


Gin 


Glu Lys 


Lys 


Arg Val 


Arg 


Asn 


Pro 


Leu 


Val 


1 






5 








10 








15 




Pro 


Leu 


Gly Ala 


Leu 


Met 


Thr Ala 


Gly 


Val Leu 


Thr 


Ala 


Gly 


Leu 


He 






20 








25 








30 






Ser 


Phe 


Arg Arg 


Gly 


Asn 


Ser Gin 


Leu 


Gly Gin 


Val 


Leu 


Met 


Arg 


Ala 






35 






40 








45 








Arg 


Val 


Val Val 


Gin 


Gly 


Ala Thr 


Val 


Ala Leu 


Met 


Val 


Gly 


Thr 


Gly 




50 








55 






60 










Tyr 


Tyr 


Tyr Gly 


Asp 


Asn 


Pro Trp 


Lys 


Lys 












65 








70 


















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:5 

















(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 97 0 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007583 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



aacgccgctc atttttaggg tttcagatat aggcagattc tcacatttgc aaacatgacg 60 

actcagatca gcaagaagag aaagtttgta gcggacggtg tgttctacgc tgaattgaat 120 

gaggttctta caagagagct agcagaggat ggttactctg gtgttgaggt tagggttact 180 

cctatgagga ctgagattat catcagagct acacgtactc agaatgttct cggtgagaag 240 

gggaggagaa ttagggaatt gacttccctt gttcagaaga gattcaagtt tccagttgac 300 

agtgttgagc tttatgccga gaaggttaac aacagaggtc tctgtgccat cgctcaggct 360 

gagtctttac gttacaagct tcttggtggt ctcgctgttc gtagggcttg ctatggtgtg 420 

ttgaggtttg ttatggagag tggagctaag ggatgcgagg ttatcgtgag tggaaagctt 480 

cgtgctgcca gagccaagtc tatgaagttc aaagatggct acatggtgtc atctggtcaa 540 

ccaactaagg aatacataga ctctgcagtg agacatgttt tgcttagaca aggtgtgttg 600 

ggaatcaagg tgaaggttat gcttgattgg gaccctaagg gcatatcagg accaaagaca 660 

ccattgcctg atgttgtgat cattcattct cctaaagaag aagaggccat ctacgcacct 720 

gctcaggttg ctgccccggc tgctctcgta gcagatgcac cactcacagc cgtagattac 780 

cctgcgatga tcccagtcgc ctaaaagaat cctttgtact agtcgatgtg attttgtttt 840 
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caactgttct tgctcttttg acagattata gcttctcgta tcttcgaatc agacacgttt 900 
cctctgttct tattcgaaaa ttatctcaag tttccattgt tgaaaaagcc ctttttagtt 960 
tcataaatcg 

(2) INFORMATION FOR SEQ ID NO: 6: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
10 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 249 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007584 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Thr Thr Gin lie Ser Lys Lys Arg Lys Phe Val Ala Asp Gly Val 
1 5 10 15 

Phe Tyr Ala Glu Leu Asn Glu Val Leu Thr Arg Glu Leu Ala Glu Asp 
20 25 30 

2 0 Gly Tyr Ser Gly Val Glu Val Arg Val Thr Pro Met Arg Thr Glu lie 
35 40 45 

lie lie Arg Ala Thr Arg Thr Gin Asn Val Leu Gly Glu Lys Gly Arg 

50 55 60 

Arg lie Arg Glu Leu Thr Ser Leu Val Gin Lys Arg Phe Lys Phe Pro 
2 5 65 70 75 " 80 

Val Asp Ser Val Glu Leu Tyr Ala Glu Lys Val Asn Asn Arg Gly Leu 

85 90 95 

Cys Ala lie Ala Gin Ala Glu Ser Leu Arg Tyr Lys Leu Leu Gly Gly 
100 105 ~ 110 

30 Leu Ala Val Arg Arg Ala Cys Tyr Gly Val Leu Arg Phe Val Met Glu 
115 " 120 " 125 

Ser Gly Ala Lys Gly Cys Glu Val lie Val Ser Gly Lys Leu Arg Ala 

130 ~ 135 140 

Ala Arg Ala Lys Ser Met Lys Phe Lys Asp Gly Tyr Met Val Ser Ser 
35 145 150 " 155 160 

Gly Gin Pro Thr Lys Glu Tyr lie Asp Ser Ala Val Arg His Val Leu 

165 ~ " 170 175 

Leu Arg Gin Gly Val Leu Gly lie Lys Val Lys Val Met Leu Asp Trp 
180 "* 185 190 

40 Asp Pro Lys Gly lie Ser Gly Pro Lys Thr Pro Leu Pro Asp Val Val 
195 200 205 

lie lie His Ser Pro Lys Glu Glu Glu Ala lie Tyr Ala Pro Ala Gin 

210 215 220 

Val Ala Ala Pro Ala Ala Leu Val Ala Asp Ala Pro Leu Thr Ala Val 
45 225 230 235 240 

Asp Tyr Pro Ala Met lie Pro Val Ala 
245 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

55 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..206 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007585 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
60 Met Arg Thr Glu lie lie lie Arg Ala Thr Arg Thr Gin Asn Val Leu 
15 10 15 

Gly Glu Lys Gly Arg Arg lie Arg Glu Leu Thr Ser Leu Val Gin Lys 
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20 25 30 

Arg Phe Lys Phe Pro Val Asp Ser Val Glu Leu Tyr Ala Glu Lys Val 

35 4 0 4 5 

Asn Asn Arg Gly Leu Cys Ala lie Ala Gin Ala Glu Ser Leu Arg Tyr 
5 50 55 60 

Lys Leu Leu Gly Gly Leu Ala Val Arg Arg Ala Cys Tyr Gly Val Leu 
65 7 0 7 5 8 0 

Arg Phe Val Met Glu Ser Gly Ala Lys Gly Cys Glu Val lie Val Ser 
85 90 95 

10 Gly Lys Leu Arg Ala Ala Arg Ala Lys Ser Met Lys Phe Lys Asp Gly 

100 105 110 

Tyr Met Val Ser Ser Gly Gin Pro Thr Lys Glu Tyr lie Asp Ser Ala 
115 120 125 





Val 


Arg 


His 


Val 


Leu 


Leu 


Arg 


Gin 


Gly 


Val 


Leu 


Gly 


He 


Lys 


Val 


Lys 


15 




130 










135 










140 












Val 


Met 


Leu 


Asp 


Trp 


Asp 


Pro 


Lys 


Gly 


He 


Ser 


Gly 


Pro 


Lys 


Thr 


Pro 




145 










150 










155 










160 




Leu 


Pro 


Asp 


Val 


Val 
165 


He 


He 


His 


Ser 


Pro 
170 


Lys 


Glu 


Glu 


Glu 


Ala 
175 


He 


20 


Tyr 


Ala 


Pro 


Ala 
180 


Gin 


Val 


Ala 


Ala 


Pro 
185 


Ala 


Ala 


Leu 


Val 


Ala 
190 


Asp 


Ala 




Pro 


Leu 


Thr 
195 


Ala 


Val 


Asp 


Tyr 


Pro 
200 


Ala 


Met 


He 


Pro 


Val 
205 


Ala 







(2) INFORMATION FOR SEQ ID NO: 8: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .768 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008148 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

aaaaaaaccc tagttttttg ctctctcgaa gttccgtgcg actctctgct cgagcaagat 60 
ctcctaccga tacaatgatt atctcagagg ctaaccgcaa agaaatctgc aagtacctct 120 
tcaaagaagg agtttgcttt gcgaagaagg atttcaatct tgctaagcat ccgttgatcg 180 
atgtcccgaa cctacaagtg attaagctta tgcagagttt caaatccaag gagtatgtta 240 

4 0 gagagacatt tgcctggatg cattactatt ggtttttgac caatgaaggg atcgagttct 300 

tgagaactta tcttaatctt ccatctgatg ttgttcctgc tactttgaag aagtcagcta 360 
agcctggtgg tcgtccattt ggtggcccac ctggtgatcg ctcaagagga cctcgccatg 420 
aaggaggaga ccgtcccagg tttggtgacc gtgatgggta ccgtgcaggt cctcgagctg 480 
gtggtgagtt tggaggtgaa aagggtggag ctcctgcaga ttaccagcca tctttccaag 540 

4 5 gaagtgggcg tggttttggc cgtggtgctg gtggctacag cgcagctgca ccatctggtt 600 
caggtttgcc ttgaaaaaga aatgtcttta ggtgacagta agaccatgga ggagttttca 660 
gctttaaatt ttgcttttgt aattcagatt ccggaatcct tcataatctc tatctgagtt 720 
tagttttgtt gttgaatcaa acatccgatt taaagttatg ttcattcc 
(2) INFORMATION FOR SEQ ID NO : 9 : 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
55 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..203 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008149 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Lys Asn Pro Ser Phe Leu Leu Ser Arg Ser Ser Val Arg Leu Ser Ala 
15 10 15 
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Arg 


Ala 


Arg 


Ser 


Pro 


Thr 


Asp 


Thr 


Met 


-r "1 _ 

lie 


lie 


Ser 


GlU 


7\ 1 ^ 

Ala 


Asn 


Arg 








9 n 
z u 










9 R 










^ n 






Lys 


GlU 


lie 


Cys 


Lys 


Tyr 


Leu 


Fne 


Lys 


gi u 


Giy 


\7 = 1 

vai 


Cys 


Fne 


Ala 


Lys 






"5 R 










4 U 










/l R 








Lys 


Asp 


fne 


Asn 


Leu 


Ala 


Lys 


his 


Pro 


Leu 


lie 


Asp 


XT-, 1 

vai 


Pro 


Asn 


Leu 




r n 










R R 










fin 










Gin 


W ^ 1 

v ai 


i le 


Lys 


Leu 


Met 


Gin 


Ser 


pne 


Lys 


Ser 


Lys 


("11-1 

GlU 


Tyr 


vai 


Arg 


£ R 




















7 R 










fl n 

O u 


GlU 


rp l_ -w- 

i nr 


Fne 


Ala 


Trp 


Met 


his 


Tyr 


Tyr 


Trp 


D V-W /~V 

pne 


Leu 


i nr 


Asn 


Glu 


Gly 










Q R 










q n 










95 




He 


Glu 


Pne 


Leu 


Arg 


Thr 


Tyr 


Leu 


Asn 


Leu 


Pro 


Ser 


Asp 


Vai 


Vai 


Pro 








inn 

1UU 










i n r 

IUj 










110 






Ala 


Thr 


Leu 


Lys 


Lys 


Ser 


Ala 


Lys 


Pro 


Gly 


Gly 


Arg 


Pro 


Phe 


Gly 


Gly 






11 J 










ion 

1ZU 










1 O R 
1ZD 








Pro 


Pro 


Gly 


Asp 


Arg 


Ser 


Arg 


Gly 


Pro 


Arg 


His 


Glu 


Gly 


Gly Asp 


Arg 




1 JU 










1 *3 R 

i j j 










i a n 

1 H u 










Pro 


Arg 


Phe 


Gly 


Asp 


Arg 


Asp 


Gly 


Tyr 


Arg 


Ala 


Gly 


Pro 


Arg 


Ala 


Gly 


145 










150 










155 










160 


Gly 


Glu 


Phe 


Gly 


Gly 


Glu 


Lys 


Gly 


Gly 


Ala 


Pro 


Ala 


Asp 


Tyr 


Gin 


Pro 










165 










170 










175 




Ser 


Phe 


Gin 


Gly 


Ser 


Gly 


Arg 


Gly 


Phe 


Gly 


Arg 


Gly 


Ala 


Gly 


Gly 


Tyr 








180 










185 










190 






Ser 


Ala 


Ala 


Ala 


Pro 


Ser 


Gly 


Ser 


Gly 


Leu 


Pro 













195 200 



2 5 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..17 9 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 1008150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 





Met 


He 


He 


Ser 


Glu 


Ala 


Asn 


Arg 


Lys 


Glu 


He 


Cys 


Lys 


Tyr 


Leu 


Phe 




1 








5 










10 










15 






Lys 


Glu 


Gly 


Vai 


Cys 


Phe 


Ala 


Lys 


Lys 


Asp 


Phe 


Asn 


Leu 


Ala 


Lys 


His 


40 








20 










25 










30 








Pro 


Leu 


He 
35 


Asp 


Vai 


Pro 


Asn 


Leu 
40 


Gin 


Vai 


He 


Lys 


Leu 
45 


Met 


Gin 


Ser 




Phe 


Lys 
50 


Ser 


Lys 


Glu 


Tyr 


Vai 
55 


Arg 


Glu 


Thr 


Phe 


Ala 
60 


Trp 


Met 


His 


Tyr 


45 


Tyr 
65 


Trp 


Phe 


Leu 


Thr 


Asn 
70 


Glu 


Gly 


He 


Glu 


Phe 
75 


Leu 


Arg 


Thr 


Tyr 


Leu 
80 




Asn 


Leu 


Pro 


Ser 


Asp 
85 


Vai 


Vai 


Pro 


Ala 


Thr 
90 


Leu 


Lys 


Lys 


Ser 


Ala 
95 


Lys 




Pro 


Gly 


Gly 


Arg 


Pro 


Phe 


Gly 


Gly 


Pro 


Pro 


Gly 


Asp 


Arg 


Ser 


Arg 


Gly 


50 








100 










105 










110 








Pro 


Arg 


His 
115 


Glu 


Gly 


Gly 


Asp 


Arg 
120 


Pro 


Arg 


Phe 


Gly 


Asp 
125 


Arg 


Asp 


Gly 




Tyr 


Arg 
130 


Ala 


Gly 


Pro 


Arg 


Ala 
135 


Gly 


Gly 


Glu 


Phe 


Gly 
140 


Gly 


Glu 


Lys 


Gly 


55 


Gly 
145 


Ala 


Pro 


Ala 


Asp 


Tyr 
150 


Gin 


Pro 


Ser 


Phe 


Gin 
155 


Gly 


Ser 


Gly 


Arg 


Gly 
160 




Phe 


Gly 


Arg 


Gly 


Ala 
165 


Gly 


Gly 


Tyr 


Ser 


Ala 
170 


Ala 


Ala 


Pro 


Ser 


Gly 
175 


Ser 



Gly Leu Pro 

60 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 134 

(D) OTHER INFORMATION: 7 Ceres Seq. ID 1008151 





(XI ) 


SEQUENCE DESCRIPTION: SEQ ID NO: 11: 












Met 


Gin 


Ser 


rne 


Lys 


ber 


Lys 


blU 


Tyr 


val 


Arg 


Glu 


Thr 


Phe 


Ala 


Trp 


1 








c 
D 










1 u 










15 




Met 


His 


Tyr 


Tyr 

20 


Trp 


Phe 


Leu 


Thr 


Asn 
25 


Glu 


Gly 


i-Le 


CjIU 


Phe 
30 


Leu 


Arg 


Thr 


Tyr 


Leu 
35 


Asn 


Leu 


Pro 


Ser 


Asp 
40 


Val 


Val 


Pro 


Ala 


Thr 

45 


Leu 


Lys 


Lys 


Ser 


Ala 
50 


Lys 


Pro 


Gly 


Gly 


Arg 

55 


Pro 


Phe 


Gly 


Gly 


Pro 
60 


Pro 


Gly 


Asp 


Arg 


Ser 


Arg 


Gly 


Pro 


Arg 


His 


Glu 


Gly 


Gly 


Asp 


Arg 


Pro 


Arg 


Phe 


Gly 


Asp 


65 










70 










75 










80*" 


Arg 


Asp 


Gly 


Tyr 


Arg 
85 


Ala 


Gly 


Pro 


Arg 


Ala 
90 


Gly 


Gly 


Glu 


Phe 


Gly 
95 


Gly 


Glu 


Lys 


Gly 


Gly 
100 


Ala 


Pro 


Ala 


Asp 


Tyr 

105 


Gin 


Pro 


Ser 


Phe 


Gin 
110 


Gly 


Ser 


Gly 


Arg 


Gly 
115 


Phe 


Gly 


Arg 


Gly 


Ala 
120 


Gly 


Gly 


Tyr 


Ser 


Ala 
125 


Ala 


Ala 


Pro 


Ser 


Gly 
130 


Ser 


Gly 


Leu 


Pro 






















(2) 


INFORMATION 


FOR 


SEQ 


ID ' 


NO: 12: 

















10 



15 



20 



25 



3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..9 95 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008334 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

atatctaaaa gactcaaaaa cctatcgtca tttatcttca acaggcggaa gaacggcgat 60 
ggccgccatt acagctctca ctctccgctc tcctgttttt cttcttcctc catcgtccgt 120 
cacaagccct agattccatg gcttcaccaa tcaaccacca ccagctcgtc tcttctttcc 180 
tcttaacccc ttcccttctc tatcaatcca aaaccctaaa tccatccgaa tttccgcatc 240 

4 5 cgcttcgccg ataacaacac caatcctcca gacggagaag tcaacagctc ggtcatcgac 300 
actcaccggg tccactcggt ctctcgctac tctcgcggct ttagcaatcg ctgtaaccag 360 
agtcctagct cagaaactct ctctcgcaat ccagacttca agtcccgtaa tcgccgacgg 420 
gttacgattc tctctcagta ccgccggacc tgtcttcttc gcgtctctcc gggatcgtcc 480 
tccgggatac ttgaacacgc cgctgacggt tgttgcggtg gggataaaga agtggctaga 540 

50 catttacagt ggggtattga tggttagggt tttgctgagt tggttcccta atatcccttg 600 
ggaaagacag cctttgtctg ccattagaga tctctgtgat ccttacttga atctcttcag 660 
aaacatcatt cctcctatct tcgatacgct tgatgttagt ccattgcttg ctttcgcggt 720 
tcttggtaca cttggatcga ttgttcatgg cagcactggg tagaaattgg aagacttgag 780 
ctaatggttt tgttgaaacg ttcaattggg aaatttctgc aacttgtttc ccaagattag 840 

55 ggaaaaatca gaatttgatg ctgtgttgga tgctcgatga gagttgtgag ctttatcgtt 900 
tttgttgttc tctttttgac tttgattctg aataatgaga ttttgggttg ttgtttatct 960 
cctaattaca tgttcagact acagtttatt gatgg 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

60 (A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..253 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008335 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Tyr Leu Lys Asp Ser Lys Thr Tyr Arg His Leu Ser Ser Thr Gly Gly 
15 10 15 

Arg Thr Ala Met Ala Ala lie Thr Ala Leu Thr Leu Arg Ser Pro Val 

20 25 30 

Phe Leu Leu Pro Pro Ser Ser Val Thr Ser Pro Arg Phe His Gly Phe 

35 40 45 

Thr Asn Gin Pro Pro Pro Ala Arg Leu Phe Phe Pro Leu Asn Pro Phe 

50 55 60 

Pro Ser Leu Ser lie Gin Asn Pro Lys Ser lie Arg lie Ser Ala Ser 
65 70 " 75 80 

Ala Ser Pro lie Thr Thr Pro lie Leu Gin Thr Glu Lys Ser Thr Ala 

85 90 95 

Arg Ser Ser Thr Leu Thr Gly Ser Thr Arg Ser Leu Ala Thr Leu Ala 

100 105 110 

Ala Leu Ala lie Ala Val Thr Arg Val Leu Ala Gin Lys Leu Ser Leu 

115 120 125 

Ala lie Gin Thr Ser Ser Pro Val lie Ala Asp Gly Leu Arg Phe Ser 

130 135 140 

Leu Ser Thr Ala Gly Pro Val Phe Phe Ala Ser Leu Arg Asp Arg Pro 
145 150 155 160 

Pro Gly Tyr Leu Asn Thr Pro Leu Thr Val Val Ala Val Gly lie Lys 

165 170 175 

Lys Trp Leu Asp lie Tyr Ser Gly Val Leu Met Val Arg Val Leu Leu 

180 185 190 

Ser Trp Phe Pro Asn lie Pro Trp Glu Arg Gin Pro Leu Ser Ala lie 

195 200 205 

Arg Asp Leu Cys Asp Pro Tyr Leu Asn Leu Phe Arg Asn lie lie Pro 

210 215 220 

Pro lie Phe Asp Thr Leu Asp Val Ser Pro Leu Leu Ala Phe Ala Val 
225 * 230 * 235 240 

Leu Gly Thr Leu Gly Ser lie Val His Gly Ser Thr Gly 

245 250 
(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..234 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008336 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
Met Ala Ala lie Thr Ala Leu Thr Leu Arg Ser Pro Val Phe Leu Leu 
15 10 15 

Pro Pro Ser Ser Val Thr Ser Pro Arg Phe His Gly Phe Thr Asn Gin 

20 25 30 

Pro Pro Pro Ala Arg Leu Phe Phe Pro Leu Asn Pro Phe Pro Ser Leu 

35 40 45 

Ser lie Gin Asn Pro Lys Ser lie Arg lie Ser Ala Ser Ala Ser Pro 

50 55 60 

lie Thr Thr Pro lie Leu Gin Thr Glu Lys Ser Thr Ala Arg Ser Ser 
65 70 75 80 

Thr Leu Thr Gly Ser Thr Arg Ser Leu Ala Thr Leu Ala Ala Leu Ala 
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o c 
o O 










y u 










y o 




Tin 

i le 


A±a 


vai 


i nr 


Arg 


val 


Leu 


Aia 


bin 


Lys 


Leu 


Ser 


Leu 


Aia 


lie 


bin 








i on 










1 U O 










1 1 U 






rp 1_ -»~ 

i nr 


Ser 


Ser 


Pro 


v ai 


i le 


ai a 


Asp 


oiy 


Leu 


Arg 


Fne 


ber 


Leu 


ber 


1 nr 






lit; 










ion 
i z u 










1 Z O 








Aia 


Lily 


Pro 


XT- "1 

v ai 


fne 


rne 


Ala 


ber 


Leu 


Arg 


Asp 


Arg 


Pro 


Pro 




Tyr 




ion 










1 jj 










1 4 u 










Leu 


Asn 


rp 1_ „ 

i nr 


Pro 


Leu 


i nr 


val 


val 


Ala 


T T — "J 

val 


Gil y 


lie 


Lys 


Lys 


Trp 


Le u 


1 A CI 

14 0 










1 C A 










loo 










loU 


ASp 


lie 


Tyr 


Ser 


Gly 


Val 


Leu 


Met 


Val 


Arg 


Val 


Leu 


Leu 


Ser 


Trp 


-n t_ 
Fne 










iOJ 










1 / u 














Pro 


Asn 


He 


Pro 


Trp 


Glu 


Arg 


Gin 


Pro 


Leu 


Ser 


Ala 


He 


Arg 


Asp 


Leu 








180 










185 










190 






Cys 


Asp 


Pro 


Tyr 


Leu 


Asn 


Leu 


Phe 


Arg 


Asn 


He 


He 


Pro 


Pro 


He 


Phe 






195 










200 










205 








Asp 


Thr 


Leu 


Asp 


Val 


Ser 


Pro 


Leu 


Leu 


Ala 


Phe 


Ala 


Val 


Leu 


Gly 


Thr 




210 










215 










220 










Leu 


Gly 


Ser 


He 


Val 


His 


Gly 


Ser 


Thr 


Gly 















225 230 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..520 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008701 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



acacataacc ctcttggaaa gagtctcaac acttgcagag aaaaagaaga aggaagatcc 60 

cggaaaatgg caacggcgat tgtacgttca gctctttccc gagcagcgac tcgcgcagct 120 

ccgaagacat ccgtcgctcc taagcgaaac ttttcctctt ccgccggcca tgacgatgct 180 

tatgaagctg cgaagtggga gaagataact tatctgggta ttgctagttg cactgctcta 24 0 

gctgtctatg ttttatccaa gggccatcat cccggcgaag accctcctgc ctatccgcat 300 

atgcacatcc gcaacaagga gtttccttgg ggtccggatg gtctgtttga ggtgaagcac 360 

aacaaagagc actgagtctt gcgtggtcat aataacgtct tcttggttta tttgaaaggc 420 

taaaatgttt taccgtattt gttctcaccg tttgtcaacg attttctact ccaatctctt 480 



ttcttttgtt gggaaataaa agttaatact ttgcttggtc 
(2) INFORMATION FOR SEQ ID NO : 1 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 124 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008702 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Thr 


His 


Asn 


Pro 


Leu 


Gly 


Lys 


Ser 


Leu 


Asn 


Thr 


Cys 


Arg 


Glu 


Lys 


Glu 


1 








5 










10 










15 




Glu 


Gly 


Arg 


Ser 


Arg 


Lys 


Met 


Ala 


Thr 


Ala 


He 


Val 


Arg 


Ser 


Ala 


Leu 








20 










25 










30 






Ser 


Arg 


Ala 


Ala 


Thr 


Arg 


Ala 


Ala 


Pro 


Lys 


Thr 


Ser 


Val 


Ala 


Pro 


Lys 






35 










40 










45 








Arg 


Asn 


Phe 


Ser 


Ser 


Ser 


Ala 


Gly 


His 


Asp 


Asp 


Ala 


Tyr 


Glu 


Ala 


Ala 




50 










55 










60 










Lys 


Trp 


Glu 


Lys 


He 


Thr 


Tyr 


Leu 


Gly 


He 


Ala 


Ser 


Cys 


Thr 


Ala 


Leu 


65 










70 










75 










80 
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Ala Val Tyr Val Leu Ser Lys Gly His His Pro Gly Glu Asp Pro Pro 

85 - 90 95 

Ala Tyr Pro His Met His lie Arg' Asn Lys Glu Phe Pro Trp Gly Pro 

100. - 105 110 

Asp Gly Leu Phe Glu Val Lys His Asn Lys Glu His 

115 ' 120 

(2) INFORMATION FOR SEQ LD NO.: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
<B) LOCATION: 1..102 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008703 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 17: 












Met 


Ala 


Thr 


Ala 


He 


Val 


Arg 


Ser 


Ala 


Leu 


Ser 


Arg 


Ala 


Ala 


Thr 


Arg 


1 








5 










10 










15 




Ala 


Ala 


Pro 


Lys 


Thr 


Ser 


Val 


Ala 


Pro 


Lys 


Arg 


Asn 


Phe 


Ser 


Ser 


Ser 








20 










25 










30 






Ala 


Gly 


His 


Asp 


Asp 


Ala 


Tyr 


Glu 


Ala 


Ala 


Lys 


Trp 


Glu 


Lys 


He 


Thr 






35 










40 










45 








Tyr 


Leu 


Gly 


He 


Ala 


Ser 


Cys 


Thr 


Ala 


Leu 


Ala 


Val 


Tyr 


Val 


Leu 


Ser 




50 










55 










60 










Lys 


Gly 


His 


His 


Pro 


Gly 


Glu 


Asp 


Pro 


Pro 


Ala 


Tyr 


Pro 


His 


Met 


His 


65 










70 










75 










80 


He 


Arg 


Asn 


Lys 


Glu 


Phe 


Pro 


Trp 


Gly 


Pro 


Asp 


Gly 


Leu 


Phe 


Glu 


Val 










85 










90 










95 




Lys 


His 


Asn 


Lys 


Glu 


His 























100 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..68 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008704 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
His He Thr Leu Leu Glu Arg Val Ser Thr Leu Ala Glu Lys Lys Lys 
15 10 15 

Lys Glu Asp Pro Gly Lys Trp Gin Arg Arg Leu Tyr Val Gin Leu Phe 

20 25 30 

Pro Glu Gin Arg Leu Ala Gin Leu Arg Arg His Pro Ser Leu Leu Ser 

35 4 0 4 5 

Glu Thr Phe Pro Leu Pro Pro Ala Met Thr Met Leu Met Lys Leu Arg 

50 55 60 

Ser Gly Arg Arg 
65 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
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(A) NAME/KEY: - 

(B) LOCATION: 1..662 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009003 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
5 cattttttct ttctctctct cctccctaag caaaactaaa acaagctatg gctggtatgc 60 
ttcccggagt tgagtgtgca aggaggcggc gcttccacgg tggtgctcct ccgattgaat 120 
cctcgaacac agcttctgtg gcggctgcgg cgggacacgt ctggacacgg cgaccatcgt 180 
tctctcttta cactaccaat catgagagcc accaggccca tgtctccttc tcggagagaa 240 
gtgttaggaa taaatcttat ggagaagaca acgatgagaa acttgacgga gcagccaaag 300 
10 aggcaaagca gaggcttaac aagcggctga gaatcccacc acgtacaagt tcaggcaaaa 360 
tggtaaagac aaagggaata aattggagca aggaaagggt aaacctctcg gggacttacc 420 
gaccgaggtg gtcgggttaa agaagagccg aggaaggttg atggaatggt tcaagcggcg 480 
agttagggaa caacaagatt gtgctatatg tctagaccgg ttcaagaagg gtgagacctt 540 
ggtacaccta ccatgtgccc ataagtttca ctccatatgc ttattgcctt ggctagacac 600 
15 taatgtttat tgcccatatt gtagaactga tatttggaat taaatgttat atttttgatg 660 
cc 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

25 (A) NAME /KEY : peptide 

(B) LOCATION: 1..130 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009004 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
Met Ala Gly Met Leu Pro Gly Val Glu Cys Ala Arg Arg Arg Arg Phe 
30 1 5 10 15 

His Gly Gly Ala Pro Pro lie Glu Ser Ser Asn Thr Ala Ser Val Ala 

20 25 30 

Ala Ala Ala Gly His Val Trp Thr Arg Arg Pro Ser Phe Ser Leu Tyr 
35 4 0 4 5 

35 Thr Thr Asn His Glu Ser His Gin Ala His Val Ser Phe Ser Glu Arg 
50 55 60 

Ser Val Arg Asn Lys Ser Tyr Gly Glu Asp Asn Asp Glu Lys Leu Asp 
65 70 75 80 

Gly Ala Ala Lys Glu Ala Lys Gin Arg Leu Asn Lys Arg Leu Arg lie 
4 0 85 90 95 

Pro Pro Arg Thr Ser Ser Gly Lys Met Val Lys Thr Lys Gly lie Asn 

100 105 110 

Trp Ser Lys Glu Arg Val Asn Leu Ser Gly Thr Tyr Arg Pro Arg Trp 
115 120 125 

45 Ser Gly 
130 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

55 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 127 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009005 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
Met Leu Pro Gly Val Glu Cys Ala Arg Arg Arg Arg Phe His Gly Gly 
60 1 5 10 15 

Ala Pro Pro lie Glu Ser Ser Asn Thr Ala Ser Val Ala Ala Ala Ala 
20 25 30 
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Cjrly 


His 


val 


Trp 


Thr 


Arg 


Arg 


Pro 


Ser 


pne 


Ser 


Leu 


T yr 


Thr 
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Asn 
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HIS 
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C /-> 
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Ala 


HIS 


val 


Ser 


pne 


ber 


LjIU 


Arg 


Ser 


val 


Arg 














3 3 




















Asn 


Lys 


Ser 


Tyr 


(jiy 


Glu 


Asp 


Asn 


Asp 


blU 


Lys 


Leu 


Asp 


(jiy 


Ala 


Ala 


DO 










1 U 










/ 3 










o U 


Lys 


Glu 


Ala 


Lys 


Gin 


Arg 


Leu 


Asn 


Lys 


Arg 


Leu 


Arg 


He 


Pro 


Pro 


Arg 










85 










90 










95 




Thr 


Ser 


Ser 


Gly 


Lys 


Met 


Val 


Lys 


Thr 


Lys 


Gly 


He 


Asn 


Trp 


Ser 


Lys 








100 










105 










110 






Glu 


Arg 


Val 


Asn 


Leu 


Ser 


Gly 


Thr 


Tyr 


Arg 


Pro 


Arg 


Trp 


Ser 


Gly 








115 










120 










125 








(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO:22: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .5 04 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009345 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



aagaaacaaa ggtgtcaaga agaaggatca acatattaat tgactaaaaa tgaatgcatg 6 

gatgataatc ttgttggtga tttgtgtcgc tgtggtggtg gagcaatcag aggctcgcaa 12 

aggtcgaaag tatttaaatc caggcgtgct tgaccggtgt cgtggtccta atcctccagc 18 

gggatgtcat cctcacaatt cccaccacaa acctcgcgtc cctgttcaca attatagtcg 24 

tggttgtagt agaattaccc ggtgcagacg agatgcctag gtttaacctc acatgacgac 30 

gacgtccctt tatattgatc cttcttctac atgagacttg caatgatatt atatattcat 36 

atatatacac attacgctgt atatgataat tccagttgat taatatataa atcttactga 42 

tttcaattca atacacaaga tcctgtaact gaaataattt tgtccatgtt ttgtgcttat 48 



aataatggta tgactcttat tatt 
(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..76 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009346 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



Met 


Asn 


Ala 


Trp 


Met 


He 


He 


Leu 


Leu 


Val 


He 


Cys 


Val 


Ala 


Val 


Val 


1 








5 










10 










15 




Val 


Glu 


Gin 


Ser 
20 


Glu 


Ala 


Arg 


Lys 


Gly 
25 


Arg 


Lys 


Tyr 


Leu 


Asn 
30 


Pro 


Gly 


Val 


Leu 


Asp 
35 


Arg 


Cys 


Arg 


Gly 


Pro 
40 


Asn 


Pro 


Pro 


Ala 


Gly 
45 


Cys 


His 


Pro 


His 


Asn 
50 


Ser 


His 


His 


Lys 


Pro 
55 


Arg 


Val 


Pro 


Val 


His 
60 


Asn 


Tyr 


Ser 


Arg 


Gly 


Cys 


Ser 


Arg 


He 


Thr 


Arg 


Cys 


Arg 


Arg 


Asp 


Ala 










65 










70 










75 












(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 24 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..72 







(D) OTHER 


INFORMATION: / Ceres 


Seq. ID 1009347 








(xi ) 


SEQUENCE DESCRIPTION: SEQ ID NO 


■ : A 4 : 










Met 


He 


He Leu 


Leu 


Val 


He Cys Val Ala 


Val Val 


Val 


Glu 


Gin 


Ser 


1 






5 




10 








15 




Glu 


Ala 


Arg Lys 


Gly 


Arg 


Lys Tyr Leu Asn 


Pro Gly 


Val 


Leu 


Asp 


Arg 






20 






25 






30 






Cys 


Arg 


Gly Pro 


Asn 


Pro 


Pro Ala Gly Cys 


His Pro 


His 


Asn 


Ser 


His 






35 






40 




45 








His 


Lys 


Pro Arg 


Val 


Pro 


Val His Asn Tyr 


Ser Arg 


Gly 


Cys 


Ser 


Arg 




50 








55 


60 










He 


Thr 


Arg Cys 


Arg 


Arg 


Asp Ala 












65 








70 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO:25: 













(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
25 (B) LOCATION: 1..967 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010140 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
attttatttc ttctccaacc attctcaatc ctctctctcc aatggcttcc 
tttcctcttc tcaccttacg ctacgcactc cttctattgc cctacgcagc 
30 ctcctcgaac ctccgtttca ttctccgtca aggctcaatc cgtcgcgctt 
atttgaagaa gctcgcggcg gagaaagctg tggaggcaat taaacctggg 
ggctcggaac cggatccacc gcagctttcg ctgttgatca gatcgggaaa 
ccggtgaact ctacgatatt gtcggtatcc caacgtcgaa acgaacggag 
ggtcgttagg gattcctctt gttgggttag atacacatcc gagaatcgat 
35 acggagcaga cgaggtagat ccgaatcttg atttagtcaa aggtcgtgga 
tccgtgagaa aatggtggaa gctgtggctg acaagtttat tgttgtggct 
aactcgttac aggactcggt ggaagtggat tagctatgcc ggtggaagtt 
gctggaactt taatttgatt agattgcaag agctcttcaa ggaatttgga 
agcttagagt tgatggtgat ggcaagcctt atgtgactga taacagtaat 
4 0 atttgtattt taagactcct ttgaaggatg gattcgctgc ggctaaagag 
ttcaaggagt ggtggagcat ggtctgtttc tcggaatggc tacttctgtc 
gaaagaatgg cgttgaagtt atgaccaagt gaggttaaaa gttcaatctt 
tttgattgac aaatttttct cctttgtaaa attatcaatt gagtttttat 
ttcaatc 

45 (2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..28 9 
55 (D) OTHER INFORMATION: / Ceres Seq. ID 1010141 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
Phe He Ser Ser Pro Thr He Leu Asn Pro Leu Ser Pro Met Ala Ser 
15 10 15 

Leu Ser Phe Val Ser Ser Ser His Leu Thr Leu Arg Thr Pro Ser He 
60 20 25 30 

Ala Leu Arg Ser Thr Gly Ser Ser Pro Arg Thr Ser Val Ser Phe Ser 
35 40 45 



ttatccttcg 60 

actggctctt 120 

tcacaggacg 180 

atggttctag 240 

ctactctctt 300 

gaacaagcac 360 

ctcgctattg 420 

ggtgctcttc 480 

gatgatacca 540 

gttcaattct 600 

tgtgaatcaa 660 

tacattattg 720 

attgggaagt 780 

attatcgctg 840 

tctcttcttc 900 

gtgagatcat 960 
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Val Lys Ala 
50 

Ala Ala Glu 
65 

5 Leu Gly Thr 

Leu Leu Ser 

Lys Arg Thr 
10 ^ 115 

Leu Asp Thr 
130 

Val Asp Pro 
145 

15 Arg Glu Lys 

Asp Asp Thr 

Pro Val Glu 

20 195 

Gin Glu Leu 
210 

Gly Asp Gly 
225 

2 5 Leu Tyr Phe 

lie Gly Lys 

Ala Thr Ser 
30 275 
Lys 



Gin Ser Val Ala 
55 

Lys Ala Val Glu 
7 0 

Gly Ser Thr Ala 
85 

Ser Gly Glu Leu 
100 

Glu Glu Gin Ala 

His Pro Arg lie 
135 

Asn Leu Asp Leu 
150 

Met Val Glu Ala 
165 

Lys Leu Val Thr 
180 

Val Val Gin Phe 

Phe Lys Glu Phe 
215 

Lys Pro Tyr Val 
230 

Lys Thr Pro Leu 
245 

Phe Gin Gly Val 
260 

Val He He Ala 
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Leu Ser Gin Asp 

Ala He Lys Pro 
7 5 

Ala Phe Ala Val 
90 

Tyr Asp He Val 
105 

Arg Ser Leu Gly 
120 

Asp Leu Ala He 

Val Lys Gly Arg 
155 

Val Ala Asp Lys 
170 

Gly Leu Gly Gly 
185 

Cys Trp Asn Phe 
200 

Gly Cys Glu Ser 

Thr Asp Asn Ser 
235 

Lys Asp Gly Phe 
250 

Val Glu His Gly 
265 

Gly Lys Asn Gly 
280 



Asp Leu Lys Lys Leu 
60 

Gly Met Val Leu Gly 
80 

Asp Gin He Gly Lys 
95 

Gly He Pro Thr Ser 
110 

He Pro Leu Val Gly 
125 

Asp Gly Ala Asp Glu 
140 

Gly Gly Ala Leu Leu 
160 

Phe He Val Val Ala 
175 

Ser Gly Leu Ala Met 
190 

Asn Leu He Arg Leu 
205 

Lys Leu Arg Val Asp 
220 

Asn Tyr He He Asp 
240 

Ala Ala Ala Lys Glu 
255 

Leu Phe Leu Gly Met 
27 0 

Val Glu Val Met Thr 
285 



(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 27 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: 1. .27 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010142 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



45 


Met 
1 


Ala 


Ser 


Leu 


Ser 
5 


Phe 


Val 


Ser 


Ser 


Ser 

10 


His 


Leu 


Thr 


Leu 


Arg 
15 


Thr 




Pro 


Ser 


He 


Ala 
20 


Leu 


Arg 


Ser 


Thr 


Gly 
25 


Ser 


Ser 


Pro 


Arg 


Thr 

30 


Ser 


Val 




Ser 


Phe 


Ser 


Val 


Lys 


Ala 


Gin 


Ser 


Val 


Ala 


Leu 


Ser 


Gin 


Asp 


Asp 


Leu 


50 






35 










40 










45 










Lys 


Lys 
50 


Leu 


Ala 


Ala 


Glu 


Lys 
55 


Ala 


Val 


Glu 


Ala 


He 
60 


Lys 


Pro 


Gly 


Met 




Val 


Leu 


Gly 


Leu 


Gly 


Thr 


Gly 


Ser 


Thr 


Ala 


Ala 


Phe 


Ala 


Val 


Asp 


Gin 




65 










70 










75 










80 


55 


He 


Gly 


Lys 


Leu 


Leu 
85 


Ser 


Ser 


Gly 


Glu 


Leu 

90 


Tyr 


Asp 


He 


Val 


Gly 
95 


He 




Pro 


Thr 


Ser 


Lys 
100 


Arg 


Thr 


Glu 


Glu 


Gin 
105 


Ala 


Arg 


Ser 


Leu 


Gly 
110 


He 


Pro 




Leu 


Val 


Gly 


Leu 


Asp 


Thr 


His 


Pro 


Arg 


lie 


Asp 


Leu 


Ala 


He 


Asp 


Gly 


60 






115 










120 










125 










Ala 


Asp 


Glu 


Val 


Asp 


Pro 


Asn 


Leu 


Asp 


Leu 


Val 


Lys 


Gly 


Arg 


Gly 


Gly 



130 135 140 
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Ala Leu Leu Arg Glu Lys Met Val Glu Ala Val Ala Asp Lys Phe lie 
145 " 150 155 160 

Val Val Ala Asp Asp Thr Lys Leu Val Thr Gly Leu Gly Gly Ser Gly 
165 170 175 

5 Leu Ala Met Pro Val Glu Val Val Gin Phe Cys Trp Asn Phe Asn Leu 
180 185 190 

lie Arg Leu Gin Glu Leu Phe Lys Glu Phe Gly Cys Glu Ser Lys Leu 

195 200 205 

Arg Val Asp Gly Asp Gly Lys Pro Tyr Val Thr Asp Asn Ser Asn Tyr 
10 210 " 215 * 220 

lie lie Asp Leu Tyr Phe Lys Thr Pro Leu Lys Asp Gly Phe Ala Ala 
225 ^ 230 235 240 

Ala Lys Glu lie Gly Lys Phe Gin Gly Val Val Glu His Gly Leu Phe 
245 250 255 

15 Leu Gly Met Ala Thr Ser Val lie lie Ala Gly Lys Asn Gly Val Glu 

260 265 270 

Val Met Thr Lys 
275 

(2) INFORMATION FOR SEQ ID NO:28: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..213 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010143 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Val Leu Gly Leu Gly Thr Gly Ser Thr Ala Ala Phe Ala Val Asp 
1 5 ' 10 15 

Gin lie Gly Lys Leu Leu Ser Ser Gly Glu Leu Tyr Asp lie Val Gly 
20 25 30 

35 lie Pro Thr Ser Lys Arg Thr Glu Glu Gin Ala Arg Ser Leu Gly lie 
35 ^40 4 5 

Pro Leu Val Gly Leu Asp Thr His Pro Arg lie Asp Leu Ala lie Asp 

50 " ^55 60 

Gly Ala Asp Glu Val Asp Pro Asn Leu Asp Leu Val Lys Gly Arg Gly 
40 65 70 75 80 

Gly Ala Leu Leu Arg Glu Lys Met Val Glu Ala Val Ala Asp Lys Phe 

85 90 95 

lie Val Val Ala Asp Asp Thr Lys Leu Val Thr Gly Leu Gly Gly Ser 
100 " 105 110 

4 5 Gly Leu Ala Met Pro Val Glu Val Val Gin Phe Cys Trp Asn Phe Asn 
115 120 125 

Leu lie Arg Leu Gin Glu Leu Phe Lys Glu Phe Gly Cys Glu Ser Lys 

130 135 ' 140 

Leu Arg Val Asp Gly Asp Gly Lys Pro Tyr Val Thr Asp Asn Ser Asn 
50 145 " 150 " " 155 160 

Tyr lie lie Asp Leu Tyr Phe Lys Thr Pro Leu Lys Asp Gly Phe Ala 

165 170 175 

Ala Ala Lys Glu lie Gly Lys Phe Gin Gly Val Val Glu His Gly Leu 
180 ~ ' 185 190 

55 Phe Leu Gly Met Ala Thr Ser Val lie lie Ala Gly Lys Asn Gly Val 
195 200 205 

Glu Val Met Thr Lys 
210 

(2) INFORMATION FOR SEQ ID NO: 29: 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..623 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010217 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



gaaaaccctt aagctaagca aatttcaaag caattcaaaa accctagcct ctctctcttt 6 

atctctcaca acaaaatctt tgaaatggcg atctctaagg cttccattgt tgttctcatg 12 

atggtgatta tctccgtcgt tgcatcggca cagtctgagg caccagcacc aagtcctact 18 

tctggatcta gtgcgatctc agcgtctttc gtgtcagctg tgcagttatc accggcggag 24 

agggaagcaa gggttttgag gtatagagag aagaggaaga atcggaaatt tgagaagacg 30 

attaggtatg cgtcgcgtaa agcttacgct gagatgaggc cgagaatcaa aggacgtttt 36 

gctaagagaa cagattcgag agagaatgat ggtggagacg tcggagttta ttgcggattc 42 

ggtgttgttc cgagtttctg atatttcccg gttaaagaaa catggtagta gtatatcggt 48 

taatagcaga ttaaggataa ttaaaactaa aatgtttcct gatttaatca ggggctaagt 54 

tattgttaga agattgttgt tttttgaatt aattctcaca attgtatcct aaattatata 60 



gtttacttaa tgttgaacta atg 
(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..118 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010218 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Met 


Ala 


He 


Ser 


Lys 


Ala 


Ser 


He 


Val 


Val 


Leu 


Met 


Met 


Val 


lie 


lie 


1 








5 










10 










15 




Ser 


Val 


Val 


Ala 


Ser 


Ala 


Gin 


Ser 


Glu 


Ala 


Pro 


Ala 


Pro 


Ser 


Pro 


Thr 








20 










25 










30 






Ser 


Gly 


Ser 


Ser 


Ala 


He 


Ser 


Ala 


Ser 


Phe 


Val 


Ser 


Ala 


Val 


Gin 


Leu 






35 










40 










45 








Ser 


Pro 


Ala 


Glu 


Arg 


Glu 


Ala 


Arg 


Val 


Leu 


Arg 


Tyr 


Arg 


Glu 


Lys 


Arg 




50 










55 










60 










Lys 


Asn 


Arg 


Lys 


Phe 


Glu 


Lys 


Thr 


He 


Arg 


Tyr 


Ala 


Ser 


Arg 


Lys 


Ala 


65 










70 










75 










80 


Tyr 


Ala 


Glu 


Met 


Arg 


Pro 


Arg 


He 


Lys 


Gly Arg 


Phe 


Ala 


Lys 


Arg 


Thr 










85 










90 










95 




Asp 


Ser 


Arg 


Glu 


Asn 


Asp 


Gly 


Gly 


Asp 


Val 


Gly 


Val 


Tyr 


Cys 


Gly 


Phe 








100 










105 










110 






Gly 


Val 


Val 


Pro 


Ser 


Phe 























115 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 107 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010219 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
Met Met Val He He Ser Val Val Ala Ser Ala Gin Ser Glu Ala Pro 
15 10 15 

Ala Pro Ser Pro Thr Ser Gly Ser Ser Ala He Ser Ala Ser Phe Val 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..155 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010303 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
Met Gly Leu Ser Gly Val Leu His Val Glu Val Glu Val Lys Ser Pro 
1 5 10 15 

Ala Glu Lys Phe Trp Val Ala Leu Gly Asp Gly lie Asn Leu Phe Pro 

20 25 " 30 

Lys Ala Phe Pro Asn Asp Tyr Lys Thr lie Gin Val Leu Ala Gly Asp 

35 "* 40 45 

Gly Asn Ala Pro Gly Ser lie Arg Leu lie Thr Tyr Gly Glu Gly Ser 

50 55 60 

Pro Leu Val Lys lie Ser Ala Glu Arg lie Glu Ala Val Asp Leu Glu 
65 70 75 80 

Asn Lys Ser Met Ser Tyr Ser lie lie Gly Gly Glu Met Leu Glu Tyr 

85 90 ' 95 

Tyr Lys Thr Phe Lys Gly Thr lie Thr Val lie Pro Lys Asn Gly Gly 

100 105 110 

Ser Leu Leu Lys Trp Ser Gly Glu Phe Glu Lys Thr Ala His Glu lie 

115 ' ' 120 ' 125 

Asp Asp Pro His Val lie Lys Asp Phe Ala Val Lys Asn Phe Lys Glu 

130 135 140 

lie Asp Glu Tyr Leu Leu Lys Gin Thr Ser Ala 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..531 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010815 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
gcgccgcccg aacccgaaga agaactctct cttgtaaacc tgcgatctga atcttcccgg 
cggtgaaaaa tggttgcgga gaagaaggcg aagaagtccc atgaaggaat caacagcaga 
ttagctcttg tgatgaagag tggaaagtac actcttggat acaagtctgt tctcaaatcc 
cttcgcagct ccaaaggtaa gctgatattg atatctagca attgcccacc gttgagaagg 
tcagagattg aatactatgc gatgcttgct aaagttggag ttcatcgcta caatggcaac 
aatgttgatt tgggtactgc ttgtggtaaa tacttccgag tttcttgcct cagcatcgtt 
gatcctggtg attccgacat catcaagaca cttcctggag atcagtgatt ctgattttga 
tgattttgcc atgttgttct ctatgcttga ttatgctttg tgttcttcaa agacttaatg 
ttagatttct ttggtttccc gacttcctct taatggttat aattaattac t 
(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



60 
120 
180 
240 
300 
360 
420 
480 



(ii) 
(ix) 



(xi) 



MOLECULE TYPE 
FEATURE: 

(A) NAME /KEY 

(B) LOCATION 



peptide 
1. .112 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



ID 1010816 
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Met 


Val 


Ala 


Glu 


Lys 


Lys 


Ala 


Lys 


Lys 


Ser 


His 


Glu 


Glv 


He 


Asn 


Ser 


1 








5 










10 










15 




Arg 


Leu 


Ala 


Leu 


Val 


Met 


Lys 


Ser 


Glv 

vj -i- jr 


Lys 


Tvr 
y 


Thr 


Leu 


Gly 


Tvr 
y 


Lys 








20 










25 










30 






Ser 


Val 


Leu 


Ly s 


Ser 


Leu 


Arg 


Ser 


Ser 


Lys 


Glv 


Lys 


Leu 


He 


Leu 


He 






35 










40 










45 








Ser 


Ser 


Asn 


Cys 


Pro 


Pro 


Leu 


Arg 


Arg 


Ser 


Glu 


He 


Glu 


Tyr 


Tvr 
y 


Ala 




50 










55 










60 










Met 


Leu 


Ala 


Lys 


Val 


Gly 


Val 


His 


Arg 


Tyr 


Asn 


Gly Asn Asn 


Val 


Asp 


65 










70 










75 










80 


Leu 


Gly 


Thr 


Ala 


Cys 


Gly 


Lys 


Tyr 


Phe 


Arg 


Val 


Ser 


Cys 


Leu 


Ser 


He 










85 










90 










95 




Val 


Asp 


Pro 


Gly 


Asp 


Ser 


Asp 


He 


He 


Lys 


Thr 


Leu 


Pro 


Gly 


Asp 


Gin 








100 










105 










110 






(2) 


INFORMATION 


FOR 


SEQ 


ID NO:37 



















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..91 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010817 





<xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO: 37 : 












Met 


Lys 


Ser 


Gly 


Lys 


Tyr 


Thr Leu Gly 


Tyr 


Lys 


Ser 


Val 


Leu 


Lys 


Ser 


1 








5 






10 










15 




Leu 


Arg 


Ser 


Ser 


Lys 


Gly 


Lys Leu lie 


Leu 


He 


Ser 


Ser 


Asn 


Cys 


Pro 








20 






25 










30 






Pro 


Leu 


Arg 


Arg 


Ser 


Glu 


He Glu Tyr 


Tyr 


Ala 


Met 


Leu 


Ala 


Lys 


Val 






35 








40 








45 








Gly 


Val 


His 


Arg 


Tyr 


Asn 


Gly Asn Asn 


Val 


Asp 


Leu 


Gly 


Thr 


Ala 


Cys 




50 










55 






60 










Gly 


Lys 


Tyr 


Phe 


Arg 


Val 


Ser Cys Leu 


Ser 


He 


Val 


Asp 


Pro 


Gly 


Asp 


65 










70 






75 










80 


Ser 


Asp 


He 


He 


Lys 


Thr 


Leu Pro Gly 


Asp 


Gin 




















85 






90 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO:38 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..7 3 

(D) OTHER INFORMATION: / Ceres Seq. ID 1010818 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
Met Arg Cys Leu Leu Lys Leu Glu Phe He Ala Thr Met Ala Thr Met 
1 5 10 15 

Leu He Trp Val Leu Leu Val Val Asn Thr Ser Glu Phe Leu Ala Ser 

20 25 30 

Ala Ser Leu He Leu Val He Pro Thr Ser Ser Arg His Phe Leu Glu 

35 40 45 

lie Ser Asp Ser Asp Phe Asp Asp Phe Ala Met Leu Phe Ser Met Leu 

50 " ~ 55 ^ 60 

Asp Tyr Ala Leu Cys Ser Ser Lys Thr 
65 70 
(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. .494 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011437 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



acaatcggaa gaaaggagaa gacgaagatg tcgttggtat ggctggaagc gatgttgcct 60 

ctcggaatca tcggtgggat gctctgtatc atgggcaatt ctcagtacta catccacaaa 120 

gcttatcatg gccgtcctaa gcacatcggc cacgatgaat gggatgtttc tatggaaaga 180 

cgcgacaaga aagtcgtcga gaaagctgca gctccttcct catgattcgc tttatctctt 240 

ttgtgttcct caggggctta aggtgaccac ttgtggtgac aaataaagtg cattccagaa 300 

gaagaagaag ctgggggatc tagtactttc attcccattt gattttcctt ggacatatta 360 

aagctttcag aaatcagacc tcaataacat ttggtttatc aatatttctc tattcgtgat 420 

attttgtatg cttttaagtt gcatgataca ctgaactact ctcttgctgt gtgagtgaat 480 



aaatgaatat ctgt 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..94 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011438 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Gin 


Ser 


Glu 


Glu 


Arg 


Arg 


Arg 


Arg 


Arg 


Cys 


Arg 


Trp 


Tyr 


Gly 


Trp 


Lys 


1 








5 










10 










15 




Arg 


Cys 


Cys 


Leu 


Ser 


Glu 


Ser 


Ser 


Val 


Gly 


Cys 


Ser 


Val 


Ser 


Trp 


Ala 








20 










25 










30 






He 


Leu 


Ser 


Thr 


Thr 


Ser 


Thr 


Lys 


Leu 


He 


Met 


Ala 


Val 


Leu 


Ser 


Thr 






35 










40 










45 








Ser 


Ala 


Thr 


Met 


Asn 


Gly 


Met 


Phe 


Leu 


Trp 


Lys 


Asp 


Ala 


Thr 


Arg 


Lys 




50 










55 










60 










Ser 


Ser 


Arg 


Lys 


Leu 


Gin 


Leu 


Leu 


Pro 


His 


Asp 


Ser 


Leu 


Tyr 


Leu 


Phe 


65 










70 










75 










80 


Cys 


Val 


Pro 


Gin 


Gly 


Leu 


Lys 


Val 


Thr 


Thr 


Cys 


Gly 


Asp 


Lys 







85 90 
(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1. .74 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011439 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
Thr He Gly Arg Lys Glu Lys Thr Lys Met Ser Leu Val Trp Leu Glu 
1 5 10 15 

Ala Met Leu Pro Leu Gly He He Gly Gly Met Leu Cys He Met Gly 

20 25 30 

Asn Ser Gin Tyr Tyr He His Lys Ala Tyr His Gly Arg Pro Lys His 

35 4 0 4 5 

He Gly His Asp Glu Trp Asp Val Ser Met Glu Arg Arg Asp Lys Lys 
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50 55 60 

Val Val Glu Lys Ala Ala Ala Pro Ser Ser 
65 70 
(2) INFORMATION FOR SEQ ID NO: 42: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
10 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..65 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011440 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Ser Leu Val Trp Leu Glu Ala Met Leu Pro Leu Gly lie lie Gly 
15 10 15 

Gly Met Leu Cys lie Met Gly Asn Ser Gin Tyr Tyr lie His Lys Ala 
20 25 ~ 30 

2 0 Tyr His Gly Arg Pro Lys His lie Gly His Asp Glu Trp Asp Val Ser 
35 4 0 4 5 

Met Glu Arg Arg Asp Lys Lys Val Val Glu Lys Ala Ala Ala Pro Ser 
50 55 ^60 

Ser 
25 65 

(2) INFORMATION FOR SEQ ID NO : 4 3 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
35 (B) LOCATION: 1..1011 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011616 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 
ctgtttcttt ttgcttcctt agacttctcc gtcgattaac ccttgccgag ggtcattttt 60 
ttttcttgaa ttttttcacc aagtgttcgt tcacggcatt gcttgtcctc cagtatgagt 120 
4 0 atctccggtg ctgcagttgg ttcaggcaga aatttaagaa gagcggtaga gtttgggaaa 180 
actcatgtgg ttaggcctaa agggaaacat caagcaacta ttgtctggtt acatgggctt 240 
ggggacaatg gctcgagctg gtcccagctt ttggagaccc ttccccttcc aaatatcaaa 300 
tggatttgcc cgactgctcc ttctcaacca ataagtttat ttggtggttt tccctccaca 360 
gcttggtttg atgttgtgga catcaatgaa gatggacctg atgatatgga aggattggat 420 
4 5 gtggctgctg cacatgttgc aaatctgttg tcgaatgagc ctgctgacat taaattaggt 480 
gttggaggat tcagcatggg tgcggcgaca tctctatatt ctgcaacttg ttttgctctc 540 
ggtaaatatg gaaatggcaa tccataccct atcaatttaa gcgcaatcat aggcttaagc 600 
ggctggcttc cttgtgcaaa gacattggct ggcaaactag aagaggaaca gatcaagaac 660 
cgagctgcat cgttacccat tgttgtctgt catggaaaag ctgatgatgt ggtaccgttc 720 
50 aagtttgggg agaaatcttc acaggctttg ctttcaaatg ggtttaagaa ggtgaccttc 780 
aaaccttaca gtgcacttgg tcaccacaca atcccacagg agttggatga gttgtgcgca 840 
tggttgacat ccacgctcag cctcgaaggt tgatacttcc tatgatgtag ctttctgatg 900 
aaaacccttc aactcttgag agtttgattg aattggatgt ttcaggattt cacaatgttt 960 
tcattggaat atttgtgtaa gacacattcg tctagtagca aatcttctcc c 
55 (2) INFORMATION FOR SEQ ID NO : 4 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

60 ' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
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(A) NAME /KEY : peptide 

(B) LOCATION: 1..252 

(D) OTHER INFORMATION: ./ Ceres Seq. ID 1011617 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 : 



5 


Met 
1 


Ser 


He 


Ser 


Gly 
5 


Ala 


Ala 


Val 


Gly 


Ser 
10 


Gly 


Arg 


Asn 


Leu 


Arg 
15 


Arg 




Ala 


Val 


Glu 


Phe 
20 


Gly 


Lys 


Thr 


His 


Val 
25 


Val 


Arg 


Pro 


Lys 


Gly 
30 


Lys 


His 




Gin 


Ala 


Thr 


He 


Val 


Trp 


Leu 


His 


Gly 


Leu 


Gly 


Asp 


Asn 


Gly 


Ser 


Ser 


10 






35 










40 










45 










Trp 


Ser 
50 


Gin 


Leu 


Leu 


Glu 


Thr 
55 


Leu 


Pro 


Leu 


Pro 


Asn 
60 


He 


Lys 


Trp 


He 




Cys 


Pro 


Thr 


Ala 


Pro 


Ser 


Gin 


Pro 


He 


Ser 


Leu 


Phe 


Gly 


Gly 


Phe 


Pro 




65 










70 










75 










80 


15 


Ser 


Thr 


Ala 


Trp 


Phe 
85 


Asp 


Val 


Val 


Asp 


He 
90 


Asn 


Glu 


Asp 


Gly 


Pro 
95 


Asp 




Asp 


Met 


Glu 


Gly 
100 


Leu 


Asp 


Val 


Ala 


Ala 
105 


Ala 


His 


Val 


Ala 


Asn 
110 


Leu 


Leu 




Ser 


Asn 


Glu 


Pro 


Ala 


Asp 


He 


Lys 


Leu 


Gly 


Val 


Gly 


Gly 


Phe 


Ser 


Met 


20 






115 










120 










125 










Gly 


Ala 
130 


Ala 


Thr 


Ser 


Leu 


Tyr 
135 


Ser 


Ala 


Thr 


Cys 


Phe 
140 


Ala 


Leu 


Gly 


Lys 




Tyr 


Gly 


Asn 


Gly 


Asn 


Pro 


Tyr 


Pro 


He 


Asn 


Leu 


Ser 


Ala 


He 


He 


Gly 




145 










150 










155 










160 


25 


Leu 


Ser 


Gly 


Trp 


Leu 
165 


Pro 


Cys 


Ala 


Lys 


Thr 
170 


Leu 


Ala 


Gly 


Lys 


Leu 
175 


Glu 




Glu 


Glu 


Gin 


He 
180 


Lys 


Asn 


Arg 


Ala 


Ala 
185 


Ser 


Leu 


Pro 


He 


Val 
190 


Val 


Cys 




His 


Gly 


Lys 


Ala 


Asp 


Asp 


Val 


Val 


Pro 


Phe 


Lys 


Phe 


Gly 


Glu 


Lys 


Ser 


30 






195 










200 










205 










Ser 


Gin 
210 


Ala 


Leu 


Leu 


Ser 


Asn 
215 


Gly 


Phe 


Lys 


Lys 


Val 
220 


Thr 


Phe 


Lys 


Pro 




Tyr 


Ser 


Ala 


Leu 


Gly 


His 


His 


Thr 


He 


Pro 


Gin 


Glu 


Leu 


Asp 


Glu 


Leu 




225 










230 










235 










240 


35 


Cys 


Ala 


Trp 


Leu 


Thr 


Ser 


Thr 


Leu 


Ser 


Leu 


Glu 


Gly 











245 250 



(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 155 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011618 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



Met 


Glu 


Gly 


Leu 


Asp 


Val 


Ala 


Ala 


Ala 


His 


Val 


Ala 


Asn 


Leu 


Leu 


Ser 


1 








5 










10 










15 




Asn 


Glu 


Pro 


Ala 


Asp 


He 


Lys 


Leu 


Gly 


Val 


Gly 


Gly 


Phe 


Ser 


Met 


Gly 








20 










25 










30 






Ala 


Ala 


Thr 


Ser 


Leu 


Tyr 


Ser 


Ala 


Thr 


Cys 


Phe 


Ala 


Leu 


Gly 


Lys 


Tyr 






35 










40 










45 








Gly 


Asn 


Gly 


Asn 


Pro 


Tyr 


Pro 


He 


Asn 


Leu 


Ser 


Ala 


He 


He 


Gly 


Leu 




50 










55 










60 










Ser 


Gly 


Trp 


Leu 


Pro 


Cys 


Ala 


Lys 


Thr 


Leu 


Ala 


Gly 


Lys 


Leu 


Glu 


Glu 


65 










70 










75 










80 


Glu 


Gin 


He 


Lys 


Asn 


Arg 


Ala 


Ala 


Ser 


Leu 


Pro 


He 


Val 


Val 


Cys 


His 










85 










90 










95 




Gly 


Lys 


Ala 


Asp 


Asp 


Val 


Val 


Pro 


Phe 


Lys 


Phe 


Gly 


Glu 


Lys 


Ser 


Ser 








100 










105 










110 
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Gin Ala Leu Leu Ser Asn Gly Phe Lys Lys Val Thr Phe Lys Pro Tyr 

115 120 125 

Ser Ala Leu Gly His His Thr lie Pro Gin Glu Leu Asp Glu Leu Cys 
130 ~ 135 140 

5 Ala Trp Leu Thr Ser Thr Leu Ser Leu Glu Gly 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
15 (A) NAME /KEY : - 

(B) LOCATION: 1..643 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011631 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 
agaatgtgat ggagatggga gtgaatgaga tgagtggtgg gattaaagaa gagaagaagg 60 

20 agaaggctct tatgcttaga ttggactatg aatcagtcat ttccacttgg ggaggccaag 120 
ggatcccatg gaccgcccgg gtgccatctg aaatagacct cgacatggtt tgtttcccaa 180 
cccataccat gggtgaaagt ggagcagagg ctcatcatca caaccacttc cgcggcctag 24 0 

ggttacacct aggagatgct ggggatggag gaagagaggc tagggtttca agataccgag 300 
agaaaaggag gacaaggttg ttctccaaga agataaggta cgaggtacgt aaattgaatg 360 

25 cagagaaaag gcctcgcatg aaaggaaggt tcgtcaagag atcttcaatt ggtgttgctc 420 
actaaagaac ttaattaatt atggatatta aattactttg ctctcatctt gcttttttgt 480 
tgctatagtt ttggtgattg ttagctttct ttttctgcat tcatagagaa ttttgcacgt 540 
ttttgtgagc tacgtatgta cataaatata tcaccaaaaa aatgtgacta tcttgtaagc 600 
actgatttat atagtcgata taacgtgaat tttgattgct ggc 

30 (2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 14 0 

40 (D) OTHER INFORMATION: / Ceres Seq. ID 1011632 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
Asn Val Met Glu Met Gly Val Asn Glu Met Ser Gly Gly lie Lys Glu 
1 5 10 J 15 

Glu Lys Lys Glu Lys Ala Leu Met Leu Arg Leu Asp Tyr Glu Ser Val 
45 20 25 30 

lie Ser Thr Trp Gly Gly Gin Gly lie Pro Trp Thr Ala Arg Val Pro 

35 4 0 45 

Ser Glu lie Asp Leu Asp Met Val Cys Phe Pro Thr His Thr Met Gly 
5 0 55 6 0 

50 Glu Ser Gly Ala Glu Ala His His His Asn His Phe Arg Gly Leu Gly 
65 ^70 75 80 

Leu His Leu Gly Asp Ala Gly Asp Gly Gly Arg Glu Ala Arg Val Ser 

85 " 90 95 

Arg Tyr Arg Glu Lys Arg Arg Thr Arg Leu Phe Ser Lys Lys lie Arg 

55 ioo 105 no 

Tyr Glu Val Arg Lys Leu Asn Ala Glu Lys Arg Pro Arg Met Lys Gly 

115 J 120 ~ 125 

Arg Phe Val Lys Arg Ser Ser lie Gly Val Ala His 
130 135 ~ 140 

60 (2) INFORMATION FOR SEQ ID NO : 4 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 







(ii) 


MOLECULE TYPE: 


peptide 


















5 




(ix) 


FEATURE: 




























(A) NAME /KEY : 


peptide 
























(B) LOCATION: 


1. . 


138 
























(D) OTHER INFORMATION 


: / ( 


:eres 


; Seq. ID 1011633 








(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO: 48 : 












10 


Met 


Glu 


Met Gly Val Asn 


Glu 


Met 


Ser 


Gly 


Gly 


He 


Lvs 


Glu 


Glu 


Lys 




1 




5 








10 










15 






Lys 


Glu 


Lys Ala Leu Met 


Leu 


Arg 


Leu 


Asp 


Tvr 
y 


Glu 


Ser 


Val 


He 


Ser 








20 






25 










30 








Thr 


Trp 


Gly Gly Gin Gly 


He 


Pro 


Trp 


Thr 


Ala 


Arg 


Val 


Pro 


Ser 


Glu 


15 






35 




40 










45 










He 


Asp 


Leu Asp Met Val 


Cys 


Phe 


Pro 


Thr 


His 


Thr 


Met 


Glv 


Glu 


Ser 






50 




55 










60 












Gly 


Ala 


Glu Ala His His. 


His 


Asn 


His 


Phe 


Arg 


Glv 


Leu 


Glv 


Leu 


His 




65 




70 










75 










80 


20 


Leu 


Gly 


Asp Ala Gly Asp 


Gly 


Gly 


Arg 


Glu 


Ala 


Arg 


Val 


Ser 


Arg 


Tyr 








85 








90 










95 






Arg 


Glu 


Lys Arg Arg Thr 


Arg 


Leu 


Phe 


Ser 


Lys 


Lys 


He 


Arg 


Tvr 


Glu 








100 






105 










110 








Val 


Arg 


Lys Leu Asn Ala 


Glu 


Lys 


Arg 


Pro 


Arg 


Met 


Lys 


Gly 


Arg 


Phe 


25 






115 




120 










125 










Val 


Lys 


Arg Ser Ser He 


Gly 


Val 


Ala 


His 


















130 




135 






















(2) 


INFORMATION FOR SEQ 


ID NO:49: 




















(i) 


SEQUENCE CHARACTERISTICS: 
















30 






(A) LENGTH: 136 amino 


acids 




















(B) TYPE: amino acid 
























(C) STRANDEDNESS: 


























(D) TOPOLOGY: 


linear 






















(ii) 


MOLECULE TYPE: 


peptide 


















35 




(ix) 


FEATURE: 




























(A) NAME /KEY : 


peptide 
























(B) LOCATION: 


1. . 


136 
























(D) OTHER INFORMATION 


: / ' 


Ceres Seq. ID 1011634 








(xi) 


SEQUENCE DESCRIPTION: ! 


SEQ 


ID NO:49: 












40 


Met 


Gly 


Val Asn Glu Met 


Ser 


Gly 


Gly 


He 


Lys 


Glu 


Glu 


Lys 


Lys 


Glu 




1 




5 








10 










15 






Lys 


Ala 


Leu Met Leu Arg 


Leu 


Asp 


Tyr 


Glu 


Ser 


Val 


He 


Ser 


Thr 


Trp 








20 






25 










30 








Gly 


Gly 


Gin Gly He Pro 


Trp 


Thr 


Ala 


Arg 


Val 


Pro 


Ser 


Glu 


He 


Asp 


45 






35 




40 










45 










Leu 


Asp 


Met Val Cys Phe 


Pro 


Thr 


His 


Thr 


Met 


Gly 


Glu 


Ser 


Gly 


Ala 






50 




55 










60 












Glu 


Ala 


His His His Asn 


His 


Phe 


Arg 


Gly 


Leu 


Gly 


Leu 


His 


Leu 


Gly 




65 




70 










75 










80 


50 


Asp 


Ala 


Gly Asp Gly Gly 


Arg 


Glu 


Ala 


Arg 


Val 


Ser 


Arg 


Tyr 


Arg 


Glu 








85 








90 










95 






Lys 


Arg 


Arg Thr Arg Leu 


Phe 


Ser 


Lys 


Lys 


He 


Arg 


Tyr 


Glu 


Val 


Arg 








100 






105 










110 








Lys 


Leu 


Asn Ala Glu Lys 


Arg 


Pro 


Arg 


Met 


Lys 


Gly 


Arg 


Phe 


Val 


Lys 


55 






115 




120 










125 










Arg 


Ser 


Ser He Gly Val 


Ala 


His 






















130 




135 






















(2) 


INFORMATION FOR SEQ 


ID : 


NO:50: 

















(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(D) TOPOLOGY: 
(ii) MOLECULE TYPE: 
(ix) FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



349 

linear 

DNA (genomic) 



(xi) 



1..663 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011714 
SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
gtgttcgata ataatgtcga caggagaagc gataccaaga gtcgccgtcg tcgttttcat 
tctcaacgga aactcaatct tattaggtcg ccgccgttcc tcaatcggca actccacttt 
cgctcttccc ggtggccacc tcgaattcgg agagagcttt gaagaatgtg cagcgagaga 
agtaatggag gaaacaggtc taaagattga aaagatgaag cttttgactg ttacaaacaa 
tgtcttcaaa gaagcaccaa cgccatcaca ctacgtctct gtttcgatac gtgcggtgtt 
ggtggatcca agtcaagaac cgaagaatat ggaaccagag aagtgtgaag gatgggattg 
gtatgattgg gagaatctac caaagccttt gttttggcca cttgagaaat tgtttggaag 
tggtttcaat cctttcactc atggtggtgg agactaatag atgtaagagt taatgattga 
tttgggattg aatgttgcaa attgggcatt tggtctagtg gtatgattct cgcttagggt 
gcgagaggtc ccgagttcaa ttctcggaat gcccctctct tttactcttt tttattgtat 
ttttggtttc tgttttccta attagtaata aacttatcat gtgtcctatc gattttattt 
etc 

INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
..(D) TOPOLOGY: linear 

peptide 



(2) 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



ID 1011715 



peptide 
1. . 151 

(D) OTHER INFORMATION: / Ceres Seq. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
Cys Ser He He Met Ser Thr Gly Glu Ala He Pro Arg Val Ala Val 
15 10 15 

Val Val Phe He Leu Asn Gly Asn Ser He Leu Leu Gly Arg Arg Arg 

20 25 30 

Ser Ser He Gly Asn Ser Thr Phe Ala Leu Pro Gly Gly His Leu Glu 

35 40 45 

Phe Gly Glu Ser Phe Glu Glu Cys Ala Ala Arg Glu Val Met Glu Glu 

50 55 60 

Thr Gly Leu Lys He Glu Lys Met Lys Leu Leu Thr Val Thr Asn Asn 
65 70 75 80 

Val Phe Lys Glu Ala Pro Thr Pro Ser His Tyr Val Ser Val Ser He 

85 90 95 

Arg Ala Val Leu Val Asp Pro Ser Gin Glu Pro Lys Asn Met Glu Pro 

100 105 " 110 

Glu Lys Cys Glu Gly Trp Asp Trp Tyr Asp Trp Glu Asn Leu Pro Lys 

115 120 ~ 125 

Pro Leu Phe Trp Pro Leu Glu Lys Leu Phe Gly Ser Gly Phe Asn Pro 

130 135 " " 140 

Phe Thr His Gly Gly Gly Asp 
145 150 
(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..147 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011716 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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aacagttgca agattccttc cccaagctgt taaggaaatg ttaatcatga ttaattagtg 360 
acaagtttcg ctgattatag tggttaatgc tggtcttatc ttcgtcagta ctacttaaaa 420 
taataaatga gtgatgatga ttgatgatta tcatcatcat gcttgaatgt tatgttttct 480 
atcttataaa atatgaaata tactttagaa atatgtttcg tcgt 
5 (2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..110 

15 (D) OTHER INFORMATION: / Ceres Seq. ID 1011785 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 



Lys 


Thr 


Glu 


Gin 


Thr 


Gin 


Thr 


Gin 


Thr 


Glu 


Asn 


Gin 


Arg 


Arg 


Arg 


Lys 


1 








5 










10 










15 




Met 


Val 


Lys 


Val 
20 


Met 


Trp 


Val 


Ser 


Val 

25 


Leu 


Ala 


Leu 


Ala 


Ala 
30 


Ala 


He 


Leu 


Leu 


Leu 
35 


Thr 


Val 


Pro 


Val 


Ala 
40 


Glu 


Gly 


Val 


Thr 


Cys 
45 


Ser 


Pro 


Met 


Gin 


Leu 
50 


Ala 


Ser 


Cys 


Ala 


Ala 

55 


Ala 


Met 


Thr 


Ser 


Ser 
60 


Ser 


Pro 


Pro 


Ser 


Glu 


Ala 


Cys 


Cys 


Thr 


Lys 


Leu 


Arg 


Glu 


Gin 


Gin 


Pro 


Cys 


Leu 


Cys 


Gly 


65 










70 










75 










80 


Tyr 


Met 


Arg 


Asn 


Pro 
85 


Thr 


Leu 


Arg 


Gin 


Tyr 
90 


Val 


Ser 


Ser 


Pro 


Asn 
95 


Ala 


Arg 


Lys 


Val 


Ser 
100 


Asn 


Ser 


Cys 


Lys 


He 
105 


Pro 


Ser 


Pro 


Ser 


Cys 
110 







(2) INFORMATION FOR SEQ ID NO : 5 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acias 

(B) TYPE: amino acid 
35 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
4 0 (B) LOCATION: 1..94 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011786 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 6 : 



Met 


Val 


Lys 


Val 


Met 


Trp 


Val 


Ser 


Val 


Leu 


Ala 


Leu 


Ala 


Ala 


Ala 


He 


1 








5 










10 










15 




Leu 


Leu 


Leu 


Thr 
20 


Val 


Pro 


Val 


Ala 


Glu 
25 


Gly 


Val 


Thr 


Cys 


Ser 
30 


Pro 


Met 


Gin 


Leu 


Ala 
35 


Ser 


Cys 


Ala 


Ala 


Ala 
40 


Met 


Thr 


Ser 


Ser 


Ser 
45 


Pro 


Pro 


Ser 


Glu 


Ala 

50 


Cys 


Cys 


Thr 


Lys 


Leu 
55 


Arg 


Glu 


Gin 


Gin 


Pro 
60 


Cys 


Leu 


Cys 


Gly 


Tyr 


Met 


Arg 


Asn 


Pro 


Thr 


Leu 


Arg 


Gin 


Tyr 


Val 


Ser 


Ser 


Pro 


Asn 


Ala 


65 










70 










75 










80 


Arg 


Lys 


Val 


Ser 


Asn 
85 


Ser 


Cys 


Lys 


He 


Pro 
90 


Ser 


Pro 


Ser 


Cys 







55 (2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

6 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
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(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011787 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



Asn 


Arg 


Thr 


Asn 


Thr 


Asn 


Thr Asn Arg 


Glu 


Ser 


Thr 


Lys 


Lvs 


Lys 


Asn 


1 








5 






10 










15 




Gly 


Glu 


Gly 


Asp 


Val 


Gly 


Phe Arg Phe 


Ser 


Ser 


Gly 


Gly 


Gly 


Asp 


Ser 








20 






25 










30 






Pro 


Phe 


Asp 


Gly 


Pro 


Gly 


Gly Arg Arg 


Gly 


Asp 


Val 


Leu 


Ala 


Tyr 


Ala 






35 








40 








45 








Ala 


Gly 


Phe 


Met 


Cys 


Gly 


Gly Asp Asp 


Val 


He 


Phe 


Ala 


Ala 


He 


Gly 




50 










55 






60 










Gly 


Val 


Leu 


His 


Lys 


Ala 


Glu Arg Ala 


Ala 


Ala 


Met 


Pro 


Leu 


Trp 


Val 


65 










70 






75 










80 


His 


Glu 


Glu 


Pro 


Tyr 


Pro 


Pro Pro lie 


Arg 






















85 






90 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 58: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

2 5 (A) NAME /KEY : - 

(B) LOCATION: 1..1007 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011820 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

ccaaatttct ccatggatcg ttctttgaat ctcctcgatt 
30 cagctagcta ttccatcgcc actaaatgga aaagtaatac 

acgagtggtt cctttgtact tcaccagcta atgaaacgtg 

gacgcactta tctttctcgc ttttgctcgc cctttctctc 

aaactgggat gtaatttagc tacccataag tcgaataatc 

ctcatggtta agtgttcaga tggggatcaa atggaagaca 
35 ctatttcggg agatacaaga aaccgttcga aagctacaga 

actgttatgg tggatgacat gtctctgctg gaaattgcta 

cacgtattgg acttcttgca ttattgccac acattaagtt 

gtcatcctca atcatgaaga tatatacgcg agcatggaga 

atggtatgcc ttgcagatgt tgtgataaag gcagagcctt 
4 0 gatgtacatg gccaattgac tgttctgaac aaagggataa 

tcgaggaaca agttgcagaa ttttcaattc aggatcaagg 

tatcctggtt gcagaagctg aggattagac accgttctga 

tgcatctaga cagttttgtt atttgcgttt aggttaaaca 

tttactttag tgtctcttct ggttgtaact tgatgactta 
45 acacagaaac ttgtagacat ggaaaattag taacattcta 
(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 
50 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
55 (B) LOCATION: 1..2 66 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011821 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
Pro Asn Phe Ser Met Asp Arg Ser Leu Asn Leu Leu Asp Leu Ala Leu 
15 10 15 

60 Gly Phe Asp Glu Gin Leu Ala lie Pro Ser Pro Leu Asn Gly Lys Val 

20 25 30 

lie Leu lie Glu Asp Cys Val Glu Thr Ser Gly Ser Phe Val Leu His 



tagccttagg gttcgatgag 60 

taatagaaga ctgtgtagag 120 

ttctctcctc taactcctcc 180 

attatgatcg aatcttgcgt 240 

gattggtgtt ctttgacatg 300 

atgtgagtgc agttgcgaaa 360 

gtgtaacaag tggtaacata 420 

ctaccggcag caactcagat 480 

ctgaaagcaa ttgttcattg 540 

gacctgcatt tttgctacag 600 

tagcctctgg tttagcaaat 660 

gcaactcagg tagaggaagc 720 

aaaatggtat cgactatttc 780 

gcattagcat ggtagtgaga 840 

gagacactct gttcaacatt 900 

gctaaagttg aatgcttgag 960 
tatcttt 
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35 40 45 

Gin Leu Met Lys Arg Val Leu Ser Ser Asn Ser Ser Asp Ala Leu lie 

5 0 ~ 55 60 

Phe Leu Ala Phe Ala Arg Pro Phe Ser His Tyr Asp Arg lie Leu Arg 
5 65 70 75 80 

Lys Leu Gly Cys Asn Leu Ala Thr His Lys Ser Asn Asn Arg Leu Val 

8 5 90 95 

Phe Phe Asp Met Leu Met Val Lys Cys Ser Asp Gly Asp Gin Met Glu 
100 105 110 

10 Asp Asn Val Ser Ala Val Ala Lys Leu Phe Arg Glu lie Gin Glu Thr 
115 120 125 

Val Arg Lys Leu Gin Ser Val Thr Ser Gly Asn lie Thr Val Met Val 

130 135 140 

Asp Asp Met Ser Leu Leu Glu lie Ala Thr Thr Gly Ser Asn Ser Asp 
15 145 150 155 160 

His Val Leu Asp Phe Leu His Tyr Cys His Thr Leu Ser Ser Glu Ser 

165 170 175 

Asn Cys Ser Leu Val lie Leu Asn His Glu Asp lie Tyr Ala Ser Met 
180 185 ~ 190 

2 0 Glu Arg Pro Ala Phe Leu Leu Gin Met Val Cys Leu Ala Asp Val Val 

195 200 205 

lie Lys Ala Glu Pro Leu Ala Ser Gly Leu Ala Asn Asp Val His Gly 

210 215 " 220 

Gin Leu Thr Val Leu Asn Lys Gly lie Ser Asn Ser Gly Arg Gly Ser 
25 225 230 ' 235 240 

Ser Arg Asn Lys Leu Gin Asn Phe Gin Phe Arg lie Lys Glu Asn Gly 

24 5 250 ' 25 5 

lie Asp Tyr Phe Tyr Pro Gly Cys Arg Ser 
260 265 
30 (2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

3 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..262 

40 (D) OTHER INFORMATION: / Ceres Seq. ID 1011822 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
Met Asp Arg Ser Leu Asn Leu Leu Asp Leu Ala Leu Gly Phe Asp Glu 
1 5 10 15 

Gin Leu Ala lie Pro Ser Pro Leu Asn Gly Lys Val lie Leu lie Glu 
45 20 25 30 

Asp Cys Val Glu Thr Ser Gly Ser Phe Val Leu His Gin Leu Met Lys 

35 40 45 

Arg Val Leu Ser Ser Asn Ser Ser Asp Ala Leu lie Phe Leu Ala Phe 
50 55 60 

50 Ala Arg Pro Phe Ser His Tyr Asp Arg lie Leu Arg Lys Leu Gly Cys 
65 70 75 80 

Asn Leu Ala Thr His Lys Ser Asn Asn Arg Leu Val Phe Phe Asp Met 

85 90 95 

Leu Met Val Lys Cys Ser Asp Gly Asp Gin Met Glu Asp Asn Val Ser 
55 100 ~ 105 110 

Ala Val Ala Lys Leu Phe Arg Glu lie Gin Glu Thr Val Arg Lys Leu 

115 " 120 125 

Gin Ser Val Thr Ser Gly Asn lie Thr Val Met Val Asp Asp Met Ser 
130 135 140 

60 Leu Leu Glu lie Ala Thr Thr Gly Ser Asn Ser Asp His Val Leu Asp 
145 150 155 160 

Phe Leu His Tyr Cys His Thr Leu Ser Ser Glu Ser Asn Cys Ser Leu 
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165 170 175 





Val 


He 


Leu 


Asn 
180 


His 


Glu 


Asp 


He 


•Tyr 
185 


Ala 


Ser 


Met 


Glu 


Arg 
190 


Pro 


Ala 




Phe 


Leu 


Leu 


Gin 


Met 


Val 


Cys 


Leu 


Ala 


Asp 


Val 


Val 


He 


Lys 


Ala 


Glu 


5 






195 










200 










205 










Pro 


Leu 
210 


Ala 


Ser 


Gly 


Leu 


Ala 
215 


Asn 


Asp 


Val 


His 


Gly 
220 


Gin 


Leu 


Thr 


Val 




Leu 


Asn 


Lys 


Gly 


He 


Ser 


Asn 


Ser 


Gly 


Arg 


Gly 


Ser 


Ser 


Arg 


Asn 


Lys 




225 










230 










235 










240 


10 


Leu 


Gin 


Asn 


Phe 


Gin 
245 


Phe 


Arg 


He 


Lys 


Glu 

250 


Asn 


Gly 


He 


Asp 


Tyr 

255 


Phe 



Tyr Pro Gly Cys Arg Ser 
260 

(2) INFORMATION FOR SEQ ID NO: 61: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
2 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 216 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011823 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



Met 


Lys 


Arg 


Val 


Leu 


Ser 


Ser 


Asn 


Ser 


Ser 


Asp 


Ala 


Leu 


He 


Phe 


Leu 


1 








5 










10 










15 




Ala 


Phe 


Ala 


Arg 


Pro 


Phe 


Ser 


His 


Tyr 


Asp 


Arg 


He 


Leu 


Arg 


Lys 


Leu 








20 










25 










30 






Gly 


Cys 


Asn 


Leu 


Ala 


Thr 


His 


Lys 


Ser 


Asn 


Asn 


Arg 


Leu 


Val 


Phe 


Phe 






35 










40 










45 








Asp 


Met 


Leu 


Met 


Val 


Lys 


Cys 


Ser 


Asp 


Gly Asp 


Gin 


Met 


Glu 


Asp 


Asn 




50 










55 










60 










Val 


Ser 


Ala 


Val 


Ala 


Lys 


Leu 


Phe 


Arg 


Glu 


He 


Gin 


Glu 


Thr 


Val 


Arg 


65 










70 










75 










80 


Lys 


Leu 


Gin 


Ser 


Val 


Thr 


Ser 


Gly 


Asn 


He 


Thr 


Val 


Met 


Val 


Asp 


Asp 










85 










90 










95 




Met 


Ser 


Leu 


Leu 


Glu 


He 


Ala 


Thr 


Thr 


Gly 


Ser 


Asn 


Ser 


Asp 


His 


Val 








100 










105 










110 






Leu 


Asp 


Phe 


Leu 


His 


Tyr 


Cys 


His 


Thr 


Leu 


Ser 


Ser 


Glu 


Ser 


Asn 


Cys 






115 










120 










125 








Ser 


Leu 


Val 


He 


Leu 


Asn 


His 


Glu 


Asp 


He 


Tyr 


Ala 


Ser 


Met 


Glu 


Arg 




130 










135 










140 










Pro 


Ala 


Phe 


Leu 


Leu 


Gin 


Met 


Val 


Cys 


Leu 


Ala 


Asp 


Val 


Val 


He 


Lys 


145 










150 










155 










160 


Ala 


Glu 


Pro 


Leu 


Ala 


Ser 


Gly 


Leu 


Ala 


Asn 


Asp 


Val 


His 


Gly 


Gin 


Leu 










165 










170 










175 




Thr 


Val 


Leu 


Asn 


Lys 


Gly 


He 


Ser 


Asn 


Ser 


Gly 


Arg 


Gly 


Ser 


Ser 


Arg 








180 










185 










190 






Asn 


Lys 


Leu 


Gin 


Asn 


Phe 


Gin 


Phe 


Arg 


He 


Lys 


Glu 


Asn 


Gly 


He 


Asp 






195 










200 










205 








Tyr 


Phe 


Tyr 


Pro 


Gly 


Cys 


Arg 


Ser 




















210 










215 




















(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO: 62: 

















55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : - 
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(B) LOCATION: 1..584 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011874 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
tcaattaaca taaccttcgc cgtcgatgaa gctttccctt caattcccac taccttcagc 60 
5 gtcgcaacga aacaacacta cgacgtaaaa ccaatccacc atgaagttta tgacggagaa 120 
aggaagatat acgacatcag ccaccagtac acgccggagt tgccggtttg ggagtcttca 180 
gaaggactag ggaactttct tagacttgcc gtgagtatga agaatggatc cgatgctaat 24 0 

atctcgaaga tggaactatc tgttcactct ggaactcatg ttgatgcacc aggccatttc 300 
catgaccatt attatgagtc tggttttgat actgattcac ttgatcttca aatccttaat 360 
10 ggtaaacatt tttggaactc tttttttttt ttttttttaa gtagctagtg atcgtttttg 420 
aaattggttt ctgtgaattt aatttatagg tcctgcttta ttggttgatg ttccaagaga 480 
taagaacatt tcaggtttgt tgagagtttt ctaacattga aatttttaca aaatcttcat 540 
taaatgtttg ttacaaaatt tttgtgtaat gatttggttg ttgt 
(2) INFORMATION FOR SEQ ID NO: 63: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
2 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..135 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011875 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Ser lie Asn lie Thr Phe Ala Val Asp Glu Ala Phe Pro Ser lie Pro 
1 5 10 15 

Thr Thr Phe Ser Val Ala Thr Lys Gin His Tyr Asp Val Lys Pro lie 
20 25 30 

30 His His Glu Val Tyr Asp Gly Glu Arg Lys lie Tyr Asp lie Ser His 
35 ' 40 45 

Gin Tyr Thr Pro Glu Leu Pro Val Trp Glu Ser Ser Glu Gly Leu Gly 

50 55 60 

Asn Phe Leu Arg Leu Ala Val Ser Met Lys Asn Gly Ser Asp Ala Asn 
35 65 70 75 ^ 80 

lie Ser Lys Met Glu Leu Ser Val His Ser Gly Thr His Val Asp Ala 

85 90 95 

Pro Gly His Phe His Asp His Tyr Tyr Glu Ser Gly Phe Asp Thr Asp 
100 105 110 

4 0 Ser Leu Asp Leu Gin lie Leu Asn Gly Lys His Phe Trp Asn Ser Phe 
115 120 125 

Phe Phe Phe Phe Leu Ser Ser 
130 135 
(2) INFORMATION FOR SEQ ID NO: 64: 
4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..62 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011981 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

acatcctaat cgaaaaaaac cctagcaaaa acctctctct ctctctcaat tcacttctcc 60 
tcctacaatg gcttggcgca acgcaggatc tgctgctcgt tctttcgtct ccgccaccgc 120 
aagatcacca tctctccgtt ctcccaccac ggcgcttcct cgcctccgtc ctcctcaatc 180 
ctccttacct agccgtcgct tcaccttctc gtcaccttcc aggaatctag gagccctagg 240 

60 ttgcacacag tcgttcttgc ctctgtacag tgttgtggct acttctcaac tgacatctca 300 
ccttaacgtt aatttgcgag ctttctgcga gctgtctaac ggtacttgaa gaggcaaaag 360 
agtgtaattt cactgcgcat gcgtgaatca gtactttgaa gggatgtgag atcgaaattg 420 
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tgggattacc tcagcaaaaa cttgcttaat gtgttgcctg ttttagaagt tttagtttca 
caatgtgtgg atttattagt tgtagagctt ttcgtatgct tggcacaatg gttttttctc 
agagaacctt tttttgtttt ggagaaatgc atagcagcat taataagtgt attggatcag 
aactgaaaca atgttgttac tgattc 
(2) INFORMATION FOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



48 
54 
60 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1. . 93 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011982 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
Met Ala Trp Arg Asn Ala Gly Ser Ala Ala Arg Ser Phe Val Ser Ala 
15 10 15 

Thr Ala Arg Ser Pro Ser Leu Arg Ser Pro Thr Thr Ala Leu Pro Arg 

20 25 30 

Leu Arg Pro Pro Gin Ser Ser Leu Pro Ser Arg Arg Phe Thr Phe Ser 

35 40 45 

Ser Pro Ser Arg Asn Leu Gly Ala Leu Gly Cys Thr Gin Ser Phe Leu 

50 55 " 60 

Pro Leu Tyr Ser Val Val Ala Thr Ser Gin Leu Thr Ser His Leu Asn 
65 70 75 80 

Val Asn Leu Arg Ala Phe Cys Glu Leu Ser Asn Gly Thr 

85 90 
(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1 . .75 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011983 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
lie Leu lie Glu Lys Asn Pro Ser Lys Asn Leu Ser Leu Ser Leu Asn 
1 5 10 15 

Ser Leu Leu Leu Leu Gin Trp Leu Gly Ala Thr Gin Asp Leu Leu Leu 

20 25 30 

Val Leu Ser Ser Pro Pro Pro Gin Asp His His Leu Ser Val Leu Pro 

35 40 45 

Pro Arg Arg Phe Leu Ala Ser Val Leu Leu Asn Pro Pro Tyr Leu Ala 

50 55 60 

Val Ala Ser Pro Ser Arg His Leu Pro Gly lie 



65 70 
(2) INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY 

(B) LOCATION 



75 



(xi) 



peptide 
1 . . 46 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011984 
SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
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Met Cys Gly Phe lie Ser Cys Arg Ala Phe Arg Met Leu Gly Thr Met 

1 5 10 15 

Val Phe Ser Gin Arg Thr Phe Phe Cys Phe Gly Glu Met His Ser Ser 

20 25 30 

lie Asn Lys Cys lie Gly Ser Glu Leu Lys Gin Cys Cys Tyr 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..4 68 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014547 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



atagaatata accatattcc ggacatgaag atcgtgacat tggtactcgt cgtcttcgtc 6 

atactttcga catcattccc ggctgccatc aaagccgaag acacgggaga tacaggaaat 12 

gtgggagtga catgtgacgc aaggcagctt cagccttgcc ttgccgcgat tacgggagga 18 

ggacaaccct cgggtgcatg ttgtgcaaag cttacagagc aacagtcatg cctatgtggt 24 

ttcgctaaga accctgcgtt cgcacagtac attagctctc caaacgctcg caaagtgctc 30 

cttgcttgca atgttgctta tcccacttgt tgaaacttat ctagatttta taaataaata 36 

aacgaaagga aataaattac attatatcaa acgttatgat acaattcaac cgtttgtgtt 42 



aatgtactgg cttacatggt taaataaagt ttaatttctt ggttgctc 
(2) INFORMATION FOR SEQ ID NO: 69: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..110 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014548 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



He 


Glu 


Tyr 


Asn 


His 


He 


Pro 


Asp 


Met 


Lys 


He 


Val 


Thr 


Leu 


Val 


Leu 


1 








5 










10 










15 




Val 


Val 


Phe 


Val 


He 


Leu 


Ser 


Thr 


Ser 


Phe 


Pro 


Ala 


Ala 


He 


Lys 


Ala 








20 










25 










30 






Glu 


Asp 


Thr 


Gly 


Asp 


Thr 


Gly 


Asn 


Val 


Gly 


Val 


Thr 


Cys 


Asp 


Ala 


Arg 






35 










40 










45 








Gin 


Leu 


Gin 


Pro 


Cys 


Leu 


Ala 


Ala 


He 


Thr 


Gly 


Gly 


Gly 


Gin 


Pro 


Ser 




50 










55 










60 










Gly 


Ala 


Cys 


Cys 


Ala 


Lys 


Leu 


Thr 


Glu 


Gin 


Gin 


Ser 


Cys 


Leu 


Cys 


Gly 


65 










70 










75 










80 


Phe 


Ala 


Lys 


Asn 


Pro 


Ala 


Phe 


Ala 


Gin 


Tyr 


He 


Ser 


Ser 


Pro 


Asn 


Ala 










85 










90 










95 




Arg 


Lys 


Val 


Leu 


Leu 


Ala 


Cys 


Asn 


Val 


Ala 


Tyr 


Pro 


Thr 


Cys 












100 










105 










110 






(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO : 7 0 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 102 
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(D) OTHER INFORMATION: / Ceres Seq. ID 1014549 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 





ixie l 


Lys 


lie 


XT-. 1 

v a i 


inr 


Leu 


XT-. "1 

v a i 


Leu 


vai 


v a i 


"D V-i 

rne 


vai 


lie 


Leu 


Q a. V- 

oe r 


l nr 




1 








5 










10 










15 




o 


Ser 


fne 


Pro 


ai a 

20 


7\ 1 _ 

Ala 


T I - 

lie 


Lys 


Ala 


CjlU 

25 


Asp 


l nr 


biy 


Asp 


l nr 

30 


uiy 


Asn 




XT — T 

val 




val 
35 


lnr 


Cys 


Asp 


Ala 


Arg 
40 


Gin 


Leu 


Oln 


Pro 


Cys 

4 O 


Leu 


Ala 


Ala 




He 


Thr 


Gly 


Gly 


Gly 


Gin 


Pro 


Ser 


Gly 


Ala 


Cys 


Cys 


Ala 


Lys 


Leu 


Thr 


10 




50 










55 










60 












Glu 


Gin 


Gin 


Ser 


Cys 


Leu 


Cys 


Gly 


Phe 


Ala 


Lys 


Asn 


Pro 


Ala 


Phe 


Ala 




65 










70 










75 










80 




Gin 


Tyr 


He 


Ser 


Ser 
85 


Pro 


Asn 


Ala 


Arg 


Lys 

90 


Val 


Leu 


Leu 


Ala 


Cys 
95 


Asn 


15 


Val 


Ala 


Tyr 


Pro 
100 


Thr 


Cys 
























(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 71 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1303 base pairs 
20 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
2 5 (A) NAME /KEY : - 

(B) LOCATION: 1..1303 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014995 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
aaaactcaag tttcttgcaa ttacctactt acaacaaagc aactcgactc gaacaaacac 60 

30 atccatggct ccttctgcgc aacctcttcc tgtgagtgtt tcggatgaaa aatatgcgaa 120 
tgtcaagtgg gaagagttgg cattcaagtt tgttcgtacg gattatatgt atgttgcgaa 180 
gtgcaatcat ggagagagtt ttcaagaggg gaagattctt ccttttgctg atttgcaact 240 
taacccttgc gctgctgttc ttcagtatgg ccagggttta tatgaaggac tgaaagctta 300 
caggacagaa gatggtcgga ttctgctatt ccgaccagac caaaacggtc tccgccttca 360 

35 agccggagct gacagactct atatgcctta tccttcggtc gatcaattcg tctccgccat 420 
caaacaagtt gctcttgcca acaagaaatg gattcctcct ccggggaaag gaacattgta 480 
tattaggcct atcttgtttg ggagtggtcc gattcttggt tcatttccca ttcctgagac 540 
caccttcaca gcttttgcct gtcctgttgg acgttatcat aaggataact ctggtttgaa 600 
tctgaaaatc gaagatcagt ttcgtcgagc ttttcctagt ggaactggtg gtgtgaagag 660 

40 catcacaaac tattgtcctg tttggatacc attggcagag gcgaaaaaac aaggtttctc 720 
tgatattttg tttttggatg ctgcaactgg caaaaacatt gaagaacttt tcgcagctaa 780 
tgtttttatg ctcaagggca atgttgtatc gacaccaaca attgcaggaa ctattttgcc 840 
cggagtcact cgaaactgcg taatggaatt gtgtcgtgat ttcggctacc aggtcgagga 900 
acgtacgatt cctctagtgg actttctcga tgcggacgaa gctttctgta ctggcactgc 960 

45 ttccattgtg actagtattg catccgtaac ctttaaagac aaaaagaccg gattcaaaac 1020 
agggaagaaa cattggctgc gaagctatac gagacgttaa gtgatatcca gacgggtcgg 1080 
gtcgaggata ccaagggatg gacggtggag attgaccgcc agggctgaaa gttgaaactg 1140 
taacttgatg ctaaatatgt gtgtgtgtgt atacaaaact tatgtaagaa acatctgaag 1200 
atgtctctga tctttgtgat tgtgatgatc atgcatgtcc tatcgttgcg atgtatttta 1260 

50 taaatgttcg tctgtaagtt atttaataac tatggctttt tgc 
(2) INFORMATION FOR SEQ ID NO : 7 2 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 amino acids 

(B) TYPE: amino acid 
55 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
60 (B) LOCATION: 1..352 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014996 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
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Lys Leu Lys Phe Leu Ala lie Thr Tyr Leu Gin Gin Ser Asn Ser Thr 
1 5 ' 10 15 

Arg Thr Asn Thr Ser Met Ala Pro Ser Ala Gin Pro Leu Pro Val Ser 
20 25 30 

5 Val Ser Asp Glu Lys Tyr Ala Asn Val Lys Trp Glu Glu Leu Ala Phe 
35 40 45 

Lys Phe Val Arg Thr Asp Tyr Met Tyr Val Ala Lys Cys Asn His Gly 

50 55 60 

Glu Ser Phe Gin Glu Gly Lys lie Leu Pro Phe Ala Asp Leu Gin Leu 
10 65 70 75 80 

Asn Pro Cys Ala Ala Val Leu Gin Tyr Gly Gin Gly Leu Tyr Glu Gly 

8 5 90 95 

Leu Lys Ala Tyr Arg Thr Glu Asp Gly Arg lie Leu Leu Phe Arg Pro 
100 105 110 

15 Asp Gin Asn Gly Leu Arg Leu Gin Ala Gly Ala Asp Arg Leu Tyr Met 
115 120 125 

Pro Tyr Pro Ser Val Asp Gin Phe Val Ser Ala lie Lys Gin Val Ala 

130 135 140 

Leu Ala Asn Lys Lys Trp lie Pro Pro Pro Gly Lys Gly Thr Leu Tyr 
20 145 150 155 160 

lie Arg Pro lie Leu Phe Gly Ser Gly Pro lie Leu Gly Ser Phe Pro 

165 170 175 

lie Pro Glu Thr Thr Phe Thr Ala Phe Ala Cys Pro Val Gly Arg Tyr 
180 185 190 

25 His Lys Asp Asn Ser Gly Leu Asn Leu Lys lie Glu Asp Gin Phe Arg 
195 200 205 

Arg Ala Phe Pro Ser Gly Thr Gly Gly Val Lys Ser lie Thr Asn Tyr 

210 215 220 

Cys Pro Val Trp lie Pro Leu Ala Glu Ala Lys Lys Gin Gly Phe Ser 
30 225 230 235 ' 240 

Asp lie Leu Phe Leu Asp Ala Ala Thr Gly Lys Asn lie Glu Glu Leu 

245 250 255 

Phe Ala Ala Asn Val Phe Met Leu Lys Gly Asn Val Val Ser Thr Pro 
260 265 270 

35 Thr lie Ala Gly Thr lie Leu Pro Gly Val Thr Arg Asn Cys Val Met 
275 280 285 

Glu Leu Cys Arg Asp Phe Gly Tyr Gin Val Glu Glu Arg Thr lie Pro 

290 ~ ~ 295 300 

Leu Val Asp Phe Leu Asp Ala Asp Glu Ala Phe Cys Thr Gly Thr Ala 
40 305 310 315 320 

Ser lie Val Thr Ser lie Ala Ser Val Thr Phe Lys Asp Lys Lys Thr 

325 330 335 

Gly Phe Lys Thr Gly Lys Lys His Trp Leu Arg Ser Tyr Thr Arg Arg 
340 345 350 

4 5 (2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..331 

55 (D) OTHER INFORMATION: / Ceres Seq. ID 1014997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 3 : 
Met Ala Pro Ser Ala Gin Pro Leu Pro Val Ser Val Ser Asp Glu Lys 
15 10 15 

Tyr Ala Asn Val Lys Trp Glu Glu Leu Ala Phe Lys Phe Val Arg Thr 
60 " 20 25 30 

Asp Tyr Met Tyr Val Ala Lys Cys Asn His Gly Glu Ser Phe Gin Glu 
35 4 0 "45 



WO 00/40695 



PCT/US00/00466 



360 



Gly 


Lys 


He 


Leu 


Pro 


Phe 


Ala 


7\ 0 T-~i 
Hop 


Leu 


Gin 


Leu 


Asn 


Pro 


Cys 


Ala 


Ala 




50 










55 










60 










Val 


Leu. 


Gin 


Tyr 


Gly 


Gin 


Gly 


Leu 


T\/T 

1 yx 


Glu 


Gly 


Leu 


Lys 


Ala 


1 y x 


Arg 


65 










70 










75 










80 


Th r 




Hop 


Gly Arg 


He 


Leu 


Leu 


pho 


7\ v-pt 
r\J- y 


P T O 


-tiO p 




As n 


Gly 


T ,0 1 T 
J-iC n 










85 




















95 




Arg 


T ,pi i 
JjCU 


Gin 


Ala 


Gly Ala Asp 


r\±. y 


LGU 


1 yr 


Met 


Pro 


1 yx 


Pro 


Ser 


Val 








100 










105 










110 






Hop 


IjXI l 


pk 0 


Val 


Ser 


Ala 


He 


Lys 


bin 


v a x 


Hxa 


Leu 


HX Cl 


A c: m 
Ho 1 1 


xi y 0 


L ys 






1 1 R 

X X J 










ion 

x z u 










1 7 R 

X ^ 








i rp 


T 1 0 

lie 


riO 


Pro 


Pro 


Gly 


Lys 


biy 


Th r 


Leu 


lyr 


lie 


hx g 




T 1 0 
lie 


Leu 




130 










135 










14 0 










IT IlC 




OCX 


Gly 


Pro 


He 


Leu 


uiy 


0 X 




Prn 
IT X 


T 1 0 


Prn 

IT X Vj 


Pin 

UlU 


Th r~ 
1 1 1 X 


Th t 

X I IX 


1 4 R 

x ^ j 










150 










1 R R 

1 j j 










1 

1DU 


it I lsr 


Th -r 


Al ;=i 

HX Cl 


Phe 


Ala 


Cys 


Pro 


Vdl 


flu 

uiy 


hx y 


1 yr 


n. x 0 


Lys 


A cn 
Ho p 


rib 1 1 












165 










x / u 










1 7 R 

X / J 






JjSU 


A <=: m 


Leu 


Lys 


He 


Glu 


Ho p 


Ulil 


Pho 


A yrr 
hx y 


A rrr 
hx y 


HX Cl 


Phe 


IT X t-J 


OCX 








180 










1 

X O J 










1 90 

X Z? KJ 






uiy 


T h y* 


uiy 


Gly Val 


Lys 


Ser 


Tip 
lie 


X 111 


7\ cn 
Holl 


T wr 

1 yr 


^ y 0 


P -y-r\ 
tr JL \J 


Vpj 1 
v a. x 


irp 


T 1 0 
lie 






195 










? n 0 

z u u 










20 5 








D -K- 


Leu 


Hid 


Glu 


Ala 


Lys 


Lys 


r 1 r\ 
tjin 


C 1 \7 

uiy 


Ph a 


oex 


Asp 


x x e 


Leu 


Ph 0 

IT 1 1ST 


Leu 




01 n 

Z. X u 










215 










z z u 










nsp 


Hid 


hx a 


Thr 


Gly 


Lys 


Asn 


lie 


Pin 
bill 


ulU 


Leu 


Ph 0 


Hxa 


Ala 
Hid 


Holl 


v ai 


225 










230 










? ^ R 

Z J J 










24 0 


ir ne 


L v iex. 


Leu 


Lys 


Gly Asn 


Val 


v ai 


Q Q v- 

oer 


Th v 

1 nr 


Pro 


Th r* 


T 1 0 

x xe 


Ala 

Hia 


r 1 \7 

uiy 


Th r* 
1 Ilx 










245 










? RH 
Z J u 










? RR 

z _> 




T 1 o 


Lgu 


riu 


Gly 


Val 


Thr 


Arg 


7\ 0 r\ 
Holl 


L»ys 


Vdl 


Moi- 
1X1 L- 


Cl 1 n 

bill 


Leu 




.M.X y 


Acn 
Ho p 








260 










265 










270 






Phe 


Gly 


Tyr 


Gin 


Val 


Glu 


Glu 


Arg 


Thr 


He 


Pro 


Leu 


Val 


Asp 


Phe 


Leu 






275 










280 










285 








Asp 


Ala 


Asp 


Glu 


Ala 


Phe 


Cys 


Thr 


Gly 


Thr 


Ala 


Ser 


He 


Val 


Thr 


Ser 




290 










295 










300 










He 


Ala 


Ser 


Val 


Thr 


Phe 


Lys 


Asp 


Lys 


Lys 


Thr 


Gly 


Phe 


Lys 


Thr 


Gly 


305 










310 










315 










320 


Lys 


Lys 


His 


Trp 


Leu 


Arg 


Ser 


Tyr 


Thr 


Arg 


Arg 













325 330 



(2) INFORMATION FOR SEQ ID NO : 7 4 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..2 97 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014998 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 



Met 


Tyr 


Val 


Ala 


Lys 


Cys 


Asn 


His 


Gly 


Glu 


Ser 


Phe 


Gin 


Glu 


Gly 


Lys 


1 








5 










10 










15 




He 


Leu 


Pro 


Phe 


Ala 


Asp 


Leu 


Gin 


Leu 


Asn 


Pro 


Cys 


Ala 


Ala 


Val 


Leu 








20 










25 










30 






Gin 


Tyr 


Gly 


Gin 


Gly 


Leu 


Tyr 


Glu 


Gly 


Leu 


Lys 


Ala 


Tyr 


Arg 


Thr 


Glu 






35 










40 










45 








Asp 


Gly 


Arg 


He 


Leu 


Leu 


Phe 


Arg 


Pro 


Asp 


Gin 


Asn 


Gly 


Leu 


Arg 


Leu 




50 










55 










60 










Gin 


Ala 


Gly 


Ala 


Asp 


Arg 


Leu 


Tyr 


Met 


Pro 


Tyr 


Pro 


Ser 


Val 


Asp 


Gin 


65 










70 










75 










80 


Phe 


Val 


Ser 


Ala 


He 


Lys 


Gin 


Val 


Ala 


Leu 


Ala 


Asn 


Lys 


Lys 


Trp 


He 










85 










90 










95 




Pro 


Pro 


Pro 


Gly 


Lys 


Gly 


Thr 


Leu 


Tyr 


He 


Arg 


Pro 


He 


Leu 


Phe 


Gly 



100 105 110 
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Ser 


Gly 


Pro 


lie 


Leu 


Gly 


Ser 


Phe 


Pro 


He 


Pro 


Glu 


Thr 


Thr 


Phe 


Thr 






1 1 c 

llj 










1 ZD 










1 Z 0 








Ala 


Fne 


7\ T 

Ala 


Cys 


Pro 


val 


bly 


Arg 


Tyr 


His 


Lys 


Asp 


Asn 


Ser 


uiy 


Leu 




u u 










1-50 










1 4 U 










Asn 


Leu 


Lys 


lie 


Glu 


Asp 


Gin 


Phe 


Arg 


Arg 


Ala 


Phe 


Pro 


Ser 


bly 


i nr 












lou 










loo 










1 bU 


Gly 


Gly 


Val 


Lys 


Ser 


He 


Thr 


Asn 


Tyr 


Cys 


Pro 


Val 


Trp 


lie 


Pro 


Leu 










1 bo 










1 / U 










I/O 




Ala 


Glu 


Ala 


Lys 


Lys 


Gin 


Gly 


Phe 


Ser 


Asp 


He 


Leu 


Phe 


Leu 


Asp 


Ala 








18 0 










18 5 










1 3 U 






Ala 


Thr 


Gly 


Lys 


Asn 


He 


Glu 


Glu 


Leu 


Phe 


Ala 


Ala 


Asn 


Val 


Phe 


Met 






195 










200 










205 








Leu 


Lys 


Gly 


Asn 


Val 


Val 


Ser 


Thr . 


Pro 


Thr 


He 


Ala 


Gly 


Thr 


He 


Leu 




9 1 C\ 

Z 1 U 










Z 1 0 










o o r\ 

220 










Pro 


Gly 


Val 


Thr 


Arg 


Asn 


Cys 


Val 


Met 


Glu 


Leu 


Cys 


Arg 


Asp 


Phe 


Gly 


ZZ J 










ZjU 










ZjO 










9 a n 

ZhV 


Tyr 


Gin 


Val 


Glu 


Glu 


Arg 


Thr 


He 


Pro 


Leu 


Val 


Asp 


Phe 


Leu 


Asp 


Ala 










245 










250 










255 




Asp 


Glu 


Ala 


Phe 


Cys 


Thr 


Gly 


Thr 


Ala 


Ser 


He 


Val 


Thr 


Ser 


He 


Ala 








260 










265 










270 






Ser 


Val 


Thr 


Phe 


Lys 


Asp 


Lys 


Lys 


Thr 


Gly 


Phe 


Lys 


Thr 


Gly 


Lys 


Lys 






275 










280 










285 








His 


Trp 


Leu 


Arg 


Ser 


Tyr 


Thr 


Arg 


Arg 

















290 295 
(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1..78 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1015323 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 5 : 



gggttaggtt tgtcttcaaa ategtcagaa gaagaagaag aagtcaaaga cgataacaat 60 

gaegtcatea tcgtacctcc gatttgegat cgccgtcgta gctttcctct caatcaccac 120 

catcaccacc gcacgtcctt gcaaaacctt cctaatctcc tcatattccc tctcaataac 180 

cccagaaaac cctaacctcg aatccgattt cacctccaca egattegtea ccgttttcac 240 

aatccgtcgt ctaaatcctc accacgtggt tcctttcttc gtcaatcgtc gtcacgagaa 300 

gccacagatc caateggate gttccctccc tttgatctct gataacatca attegtttag 360 

agatcgaacc agagatattc ttagcgtcgt cgtcgcgctt ttgttcggtg ttggttgtgg 420 

tgetttaace getgetacta tgtatcttgt ttgggctctt gttgtgaatc gtcaaagcta 480 

tgattttgag gaagaagaag atgattatga gaatgatgaa tetgatgetg ctagcttgaa 540 

gaaattgggt tatgttaaga ttccagctcc agctcctgct cctgtgaagg aagctgcttg 600 

atttgaggtt gtgaatttca gtttctgeat attatcatga tttacgttgt tgcttatcta 660 

tgttgttgcg tacaagtaaa ttcgtctctg ttatggattt atctaaactt actatactct 720 

aagtatcagt gtttttacga tttgtagttg atgtttctta tatatatata taagtgtgat 780 



tatgatgtc 

(2) INFORMATION FOR SEQ ID NO : 7 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..18 0 

(D) OTHER INFORMATION: / Ceres Seq. ID 1015324 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
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FJe t 


1 nr 


Sex" 


Ser 


Ser 


Tyr 


Leu 


Arg 


irne 


ai a 


i le 


Ala 


vai 


val 


Ala 


Fne 


1 


















1 u 










1 3 




Leu 


Ser 


lie 


i nr 


T 1 V» -y~ 

i nr 


lie 


T» K -v~ 

i nr 


rp l_ „ 

i nr 


Ala 


Arg 


Pro 


Cys 


Lys 


T» K -v~ 

i nr 


rne 


Leu 








9 n 
z u 










9 c; 










3 u 






i xe 






Tyr 


Ser 


Leu 


Ser 


lie 


i nr 


Pro 


LjlU 


Asn 


Pro 


Asn 


Leu 








J 3 










a n 










A S 
1 3 








ber 


Asp 


TDK ^ 

Fne 


i nr 


Ser 


rp v. „ 

i nr 


Arg 


rne 


val 


i nr 


v ai 


rne 


i nr 


lie 


Arg 


Arg 




3 U 










3 3 










D U 










Leu 


Asn 


Pro 


His 


His 


val 


Val 


Pro 


Phe 


Phe 


Val 


Asn 


Arg 


Arg 


His 


Glu 


D 3 










"7 n 










/ 3 










o U 


Lys 


Pro 


Gin 


I le 


Gin 


Ser 


Asp 


Arg 


Ser 


Leu 


Pro 


Leu 


He 


Ser 


Asp 


Asn 










O 3 










y u 










y 3 




lie 


Asn 


Ser 


Phe 


Arg 


Asp 


Arg 


Thr 


Arg 


Asp 


lie 


Leu 


ber 


Val 


Val 


val 








inn 
1 U U 










1 U 0 










I 1 A 

I I u 






7\ "I — 

Ala 


Leu 


Leu 


Phe 


Gly 


Val 


Gly 


Cys 


Gly 


Ala 


Leu 


Thr 


Ala 


Ala 


Thr 


Met 






I 1 c. 

II J 










1ZU 










1 oc. 
_L Z. 3 








Tyr 


Leu 


Val 


Trp 


Ala 


Leu 


Val 


Val 


Asn 


Arg 


Gin 


Ser 


Tyr 


Asp 


Phe 


Glu 




130 










135 










140 










Glu 


Glu 


Glu 


Asp 


Asp 


Tyr 


Glu 


Asn 


Asp 


Glu 


Ser 


Asp 


Ala 


Ala 


Ser 


Leu 


145 










150 










155 










160 


Lys 


Lys 


Leu 


Gly 


Tyr 


Val 


Lys 


He 


Pro 


Ala 


Pro 


Ala 


Pro 


Ala 


Pro 


Val 










165 










170 










175 




Lys 


Glu 


Ala 


Ala 



























180 



(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..495 

(D) OTHER INFORMATION: / Ceres Seq. ID 1016486 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
acttcctcat tagcattcta cagtctgcac acttatcact tctttcgaaa tctctctcta 
tccctctcaa aaatgaagct ctctgtgcgt tttatctccg ctgctcttct cttgttcatg 



gtattcattg ccacagggat gggtccagtc accgtggagg cacgcacgtg tgagtcaaag 18 

agccataggt tcaagggtcc atgtgtgagc acacacaact gtgcaaacgt gtgccacaac 24 

gaaggcttcg gcggaggtaa atgccgtgga ttccgtcgtc gttgctactg cacaagacac 30 

tgctgatcca tccattctca tgactcaaat cttcgatcca tcgtcagtgt gttacttctt 36 

tcttatctaa atcttccgta cggtaccata tcgtaccgta catgagtgtt ttctcaaata 42 

agtcgttggt ttgtgtgttt ccggttttaa tgtaatgtta aatcaattaa tggcttttaa 48 



tatattgtat tatgg 

(2) INFORMATION FOR SEQ ID NO : 7 8 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 101 

(D) OTHER INFORMATION: / Ceres Seq. ID 1016487 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
Thr Ser Ser Leu Ala Phe Tyr Ser Leu His Thr Tyr His Phe Phe Arg 
15 10 15 

Asn Leu Ser Leu Ser Leu Ser Lys Met Lys Leu Ser Val Arg Phe lie 

20 25 30 

Ser Ala Ala Leu Leu Leu Phe Met Val Phe lie Ala Thr Gly Met Gly 
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35 40 45 

Pro Val Thr Val Glu Ala Arg Thr Cys Glu Ser Lys Ser His Arg Phe 

50 55 60 

Lys Gly Pro Cys Val Ser Thr His Asn Cys Ala Asn Val Cys His Asn 
65 70 75 80 

Glu Gly Phe Gly Gly Gly Lys Cys Arg Gly Phe Arg Arg Arg Cys Tyr 

85 " 90 95 

Cys Thr Arg His Cys 
100 

(2) INFORMATION FOR SEQ ID NO : 7 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .77 

(D) OTHER INFORMATION: / Ceres Seq. ID 1016488 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Met 


Lys 


Leu 


Ser 


Val 


Arg 


Phe 


He 


Ser 


Ala 


Ala 


Leu 


Leu 


Leu 


Phe 


Met 


1 








5 










10 










15 




Val 


Phe 


He 


Ala 


Thr 


Gly 


Met 


Gly 


Pro 


Val 


Thr 


Val 


Glu 


Ala 


Arg 


Thr 








20 










25 










30 






Cys 


Glu 


Ser 


Lys 


Ser 


His 


Arg 


Phe 


Lys 


Gly 


Pro 


Cys 


Val 


Ser 


Thr 


His 






35 










40 










45 








Asn 


Cys 


Ala 


Asn 


Val 


Cys 


His 


Asn 


Glu 


Gly 


Phe 


Gly 


Gly 


Gly 


Lys 


Cys 




50 










55 










60 










Arg 


Gly 


Phe 


Arg 


Arg 


Arg 


Cys 


Tyr 


Cys 


Thr 


Arg 


His 


Cys 








65 










70 










75 












(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 80: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: / Ceres Seq. ID 1016489 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
Met Val Phe He Ala Thr Gly Met Gly Pro Val Thr Val Glu Ala Arg 
15 10 15 

Thr Cys Glu Ser Lys Ser His Arg Phe Lys Gly Pro Cys Val Ser Thr 

20 25 30 

His Asn Cys Ala Asn Val Cys His Asn Glu Gly Phe Gly Gly Gly Lys 

35 40 45 

Cys Arg Gly Phe Arg Arg Arg Cys Tyr Cys Thr Arg His Cys 

50 55 60 

(2) INFORMATION FOR SEQ ID NO : 8 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..325 

(D) OTHER INFORMATION: / Ceres Seq. ID 1018341 



WO 00/40695 



PC17US00/00466 



364 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
atcatcaaac acaaaacaat caatcaaaaa cataaaattc aaagtgtaag caaaaccaga 60 
acgagagaaa atggacaaca agcaaaacgc gagctaccaa gccggtcaag ccactggcca 120 
gactaaggag aaggccggtg gaatgatgga caaggccaag gatgctgctg cttcagctca 180 
5 agactccttg caacagactg gacaacaaat gaaggagaag gcgcaaggag ctgctgatgt 240 
cgtcaaggac aagaccggca tgaacaaaag ccactaagca agcatcttgg acttcttttc 300 
taattaattc ccctcttcga cttgt 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

15 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 68 

(D) OTHER INFORMATION: / Ceres Seq. ID 1018342 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
20 Met Asp Asn Lys Gin Asn Ala Ser Tyr Gin Ala Gly Gin Ala Thr Gly 
15 10 15 

Gin Thr Lys Glu Lys Ala Gly Gly Met Met Asp Lys Ala Lys Asp Ala 

20 25 30 

Ala Ala Ser Ala Gin Asp Ser Leu Gin Gin Thr Gly Gin Gin Met Lys 
25 35 40 45 

Glu Lys Ala Gin Gly Ala Ala Asp Val Val Lys Asp Lys Thr Gly Met 

50 55 60 

Asn Lys Ser His 
65 

30 (2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 44 

40 (D) OTHER INFORMATION: / Ceres Seq. ID 1018343 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
Met Met Asp Lys Ala Lys Asp Ala Ala Ala Ser Ala Gin Asp Ser Leu 
1 " 5 10 15 

Gin Gin Thr Gly Gin Gin Met Lys Glu Lys Ala Gin Gly Ala Ala Asp 
45 20 25 30 

Val Val Lys Asp Lys Thr Gly Met Asn Lys Ser His 

35 ^ 40 

(2) INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
55 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..4 3 

(D) OTHER INFORMATION: / Ceres Seq. ID 1018344 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
60 Met Asp Lys Ala Lys Asp Ala Ala Ala Ser Ala Gin Asp Ser Leu Gin 
1 J 5 10 15 

Gin Thr Gly Gin Gin Met Lys Glu Lys Ala Gin Gly Ala Ala Asp Val 
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20 25 30 

Val Lys Asp Lys Thr Gly Met Asn Lys Ser His 

35 ~ 4 0 

(2) INFORMATION FOR SEQ ID NO: 85: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .540 

(D) OTHER INFORMATION: / Ceres Seq. ID 1018382 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
taaaagcaat aaaaaacaaa aatggcgttt actccgaaga tcatcacatg cctcattgtc 



cttacgatct acatgacatc cccaacagag tcaaccatcc agtgtgggac agtgacgagc 1 

acactggcac agtgcgtgac ctacttgacc aacagtggtc cattgccatc acaatgctgc 1 

gtgggagtca agtcattgta ccaattggct cagaccacac cggaccgtaa acaagtatgt 2 

gagtgcctta aactagcggg taaagaaatc aagggcctca acaccgacct tgtggccgca 3 

cttcctacca cttgtggtgt ttcaattccc taccccatca gttttagcac caattgcgac 3 

agtatatcga ctgccgtgtg aaagaggcta gtgatcagat gtacgactaa tcaaacttgc 4 

cagcttttaa cctaattaaa taaaagtatt ctgcttatat ttcccatttt atgattttat 4 



cttcttatct atgtaaccac acgatttcat atgctaataa tgacaacgga tctttctctc 
(2) INFORMATION FOR SEQ ID NO : 8 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 119 

(D) OTHER INFORMATION: / Ceres Seq. ID 1018383 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



Met 


Ala 


Phe 


Thr 


Pro 


Lys 


He 


He 


Thr 


Cys 


Leu 


He 


Val 


Leu 


Thr 


He 


1 








5 










10 










15 




Tyr 


Met 


Thr 


Ser 


Pro 


Thr 


Glu 


Ser 


Thr 


He 


Gin 


Cys 


Gly 


Thr 


Val 


Thr 








20 










25 










30 






Ser 


Thr 


Leu 


Ala 


Gin 


Cys 


Val 


Thr 


Tyr 


Leu 


Thr 


Asn 


Ser 


Gly 


Pro 


Leu 






35 










40 










45 








Pro 


Ser 


Gin 


Cys 


Cys 


Val 


Gly 


Val 


Lys 


Ser 


Leu 


Tyr 


Gin 


Leu 


Ala 


Gin 




50 










55 










60 










Thr 


Thr 


Pro 


Asp 


Arg 


Lys 


Gin 


Val 


Cys 


Glu 


Cys 


Leu 


Lys 


Leu 


Ala 


Gly 


65 










70 










75 










80 


Lys 


Glu 


He 


Lys 


Gly 


Leu 


Asn 


Thr 


Asp 


Leu 


Val 


Ala 


Ala 


Leu 


Pro 


Thr 










85 










90 










95 




Thr 


Cys 


Gly 


Val 


Ser 


He 


Pro 


Tyr 


Pro 


He 


Ser 


Phe 


Ser 


Thr 


Asn 


Cys 








100 










105 










110 






Asp 


Ser 


He 


Ser 


Thr 


Ala 


Val 
























115 




























(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO: 87 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 102 
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25 



30 



35 



40 



45 



50 



55 



60 
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(D) OTHER INFORMATION: / Ceres Seq. ID 1018384 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
Met Thr Ser Pro Thr Glu Ser Thr lie Gin Cys Gly Thr Val Thr Ser 
1 5 10 15 

Thr Leu Ala Gin Cys Val Thr Tyr Leu Thr Asn Ser Gly Pro Leu Pro 

20 " 25 30 

Ser Gin Cys Cys Val Gly Val Lys Ser Leu Tyr Gin Leu Ala Gin Thr 

35 40 ' 45 

Thr Pro Asp Arg Lys Gin Val Cys Glu Cys Leu Lys Leu Ala Gly Lys 

50 55 " 60 

Glu lie Lys Gly Leu Asn Thr Asp Leu Val Ala Ala Leu Pro Thr Thr 
65 70 75 80 

Cys Gly Val Ser lie Pro Tyr Pro lie Ser Phe Ser Thr Asn Cys Asp 

85 " 90 95 

Ser lie Ser Thr Ala Val 
100 

(2) INFORMATION FOR SEQ ID NO : 8 8 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



(xi) 



peptide 
1. .75 

(D) OTHER INFORMATION: / Ceres Seq. ID 1018385 
SEQUENCE DESCRIPTION: SEQ ID NO : 8 8 : 
Lys Ala lie Lys Asn Lys Asn Gly Val Tyr Ser Glu Asp His His Met 
1 5 " 10 15 

Pro His Cys Pro Tyr Asp Leu His Asp lie Pro Asn Arg Val Asn His 

20 25 30 

Pro Val Trp Asp Ser Asp Glu His Thr Gly Thr Val Arg Asp Leu Leu 

35 40 45 

Asp Gin Gin Trp Ser lie Ala lie Thr Met Leu Arg Gly Ser Gin Val 

50 55 60 

lie Val Pro lie Gly Ser Asp His Thr Gly Pro 

' 70 ^ ^75 

INFORMATION FOR SEQ ID NO : 8 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
DNA (genomic) 



65 
(2) 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



1. .844 

(D) OTHER INFORMATION: / Ceres Seq. ID 1020666 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 9 : 

ctaaatattt aaagctttcc ttcaccagca ttgtattgcc tccctggaac accgaagctc 60 

tggattaacc tattttcacc ttctaaggtt cttcgtcgtt cttattgatc tgtgaagatg 120 

tattcgggtc agaacaagat ccacaaggat aagggtgttg caccaacaga gtttgaagag 180 

caagttactc aggctttgtt tgacttggag aacaccaacc aggagttgaa aagcgagttg 240 

aaagatctct acattaacca agctgttcag atggatattt ctggcaaccg caaagctgtt 300 

gtgatctacg ttccattcag gctgaggaaa gctttccgca agattcatct tcgtctggtc 360 

agagagcttg agaagaagtt cagtggaaaa gatgtgatct ttgttgctac cagaagaatc 420 

atgcgcccac caaagaaagg ctcagctgtt cagagaccac gcaacagaac tcttacttct 480 

gtccatgaag ccatgctcga ggatgtcgct taccctgctg agattgtagg aaagagaacc 540 

agataccgtc ttgatggcac caagatcatg aaggtatttt tggatagtaa gctaaagaac 600 

gacactgagt acaagctgga gacaatggtg ggtgtgtaca gaaaacttac agggaaagat 660 

gtagttttcg agtacccagt catagaagct tgaagagatg aatgtttgtt tcattctttt 720 
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ttgtcggata gagagctttt gattctgttt gggagtaaaa ctctttgcaa agattttgtt 78 
atttcagcat ttcgcaatct cttcttcaaa gttcgattta tcgatacaat tcttttttcc 84 
tacc 

(2) INFORMATION FOR SEQ ID NO: 90: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..191 

<D) OTHER INFORMATION: / Ceres Seq. ID 1020667 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



Met 


Tvr 


Ser 


Gly 


Gin 


Asn 


Lys 


He 


His 


Lys 


Asp 


Lys 


Gly 


Val 


Ala 


Pro 


1 








5 










10 










15 




Thr 


Glu 


Phe 


Glu 


Glu 


Gin 


Val 


Thr 


Gin 


Ala 


Leu 


Phe 


Asp 


Leu 


Glu 


Asn 








20 










25 










30 






Thr 


Asn 


Gin 


Glu 


Leu 


Lys 


Ser 


Glu 


Leu 


Lys 


Asp 


Leu 


Tyr 


He 


Asn 


Gin 






35 










40 










45 








Ala 


Val 


Gin 


Met 


Asp 


He 


Ser 


Gly 


Asn 


Arg 


Lys 


Ala 


Val 


Val 


He 


Tyr 




50 










55 










60 










Val 


Pro 


Phe 


Arg 


Leu 


Arg 


Lys 


Ala 


Phe 


Arg 


Lys 


He 


His 


Leu 


Arg 


Leu 


65 










70 










75 










80 


Val 


Arg 


Glu 


Leu 


Glu 


Lys 


Lys 


Phe 


Ser 


Gly 


Lys 


Asp 


Val 


He 


Phe 


Val 










85 










90 










95 




Ala 


Thr 


Arg 


Arg 


He 


Met 


Arg 


Pro 


Pro 


Lys 


Lys 


Gly 


Ser 


Ala 


Val 


Gin 








100 










105 










110 






Arg 


Pro 


Arg 


Asn 


Arg 


Thr 


Leu 


Thr 


Ser 


Val 


His 


Glu 


Ala 


Met 


Leu 


Glu 






115 










120 










125 








Asp 


Val 


Ala 


Tyr 


Pro 


Ala 


Glu 


He 


Val 


Gly 


Lys 


Arg 


Thr 


Arg 


Tyr 


Arg 




130 










135 










140 










Leu 


Asp 


Gly 


Thr 


Lys 


He 


Met 


Lys 


Val 


Phe 


Leu 


Asp 


Ser 


Lys 


Leu 


Lys 


145 










150 










155 










160 


Asn 


Asp 


Thr 


Glu 


Tyr 


Lys 


Leu 


Glu 


Thr 


Met 


Val 


Gly 


Val 


Tyr 


Arg 


Lys 










165 










170 










175 




Leu 


Thr 


Gly 


Lys 


Asp 


Val 


Val 


Phe 


Glu 


Tyr 


Pro 


Val 


He 


Glu 


Ala 





180 185 190 



(2) INFORMATION FOR SEQ ID NO: 91: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 



(A) NAME /KEY : peptide 

(B) LOCATION: 1..140 

(D) OTHER INFORMATION: / Ceres Seq. ID 1020668 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 91: 












Met 


Asp 


He 


Ser 


Gly 


Asn 


Arg 


Lys 


Ala 


Val 


Val 


He 


Tyr 


Val 


Pro 


Phe 


1 








5 










10 










15 




Arg 


Leu 


Arg 


Lys 


Ala 


Phe 


Arg 


Lys 


He 


His 


Leu 


Arg 


Leu 


Val 


Arg 


Glu 








20 










25 










30 






Leu 


Glu 


Lys 


Lys 


Phe 


Ser 


Gly 


Lys 


Asp 


Val 


He 


Phe 


Val 


Ala 


Thr 


Arg 






35 










40 










45 








Arg 


He 


Met 


Arg 


Pro 


Pro 


Lys 


Lys 


Gly 


Ser 


Ala 


Val 


Gin 


Arg 


Pro 


Arg 




50 










55 










60 










Asn 


Arg 


Thr 


Leu 


Thr 


Ser 


Val 


His 


Glu 


Ala 


Met 


Leu 


Glu 


Asp 


Val 


Ala 


65 










70 










75 










80 


Tyr 


Pro 


Ala 


Glu 


He 


Val 


Gly 


Lys 


Arg 


Thr 


Arg 


Tyr 


Arg 


Leu 


Asp 


Gly 
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(ii) 
(ix) 



(xi) 



368 

85 90 95 

Thr Lys lie Met Lys Val Phe Leu Asp Ser Lys Leu Lys Asn Asp Thr 

100 * 105 ' 110 

Glu Tyr Lys Leu Glu Thr Met Val Gly Val Tyr Arg Lys Leu Thr Gly 

115 120 125 

Lys Asp Val Val Phe Glu Tyr Pro Val lie Glu Ala 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..78 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1020784 
SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
agggttaggt ttgtcttcaa aatcgtcaga agaagaagaa gaagtcaaag acgataacaa 
tgacgtcatc atcgtacctc cgatttgcga tcgccgtcgt agctttcctc tcaatcacca 
ccatcaccac cgcacgtcct tgcaaaacct tcctaatctc ctcatattcc ctctcaataa 
ccccagaaaa ccctaacctc gaatccgatt tcacctccac acgattcatc accgttttca 
caatccgtcg tctaaatcct caccacgtgg ttcctttctt cgtcaatcgt cgtcacgaga 
agccacagat ccaatcggat cgttccctcc ctttgatctc tgataacatc aattcgttta 
gagatcgaac cagagatatt cttagcgtcg tcgtcgcgct tttgttcggt gttggttgtg 
gtgctttaac cgctgctact atgtatcttg tttgggctct tgttgtgaat cgtcaaagct 
atgattttga ggaagaagaa gatgattatg agaatgatga atctgatgct gctagcttga 
agaaattggg ttatgttaag attccagctc cagctcctgc tcctgtgaag gaagctgctt 
gatttgaggt tgtgaatttc agtttctgca tattatcatg atttacgttg ttgcttatct 
atgttgttgc gtacaagtaa attcgtctct gttatggatt tatctaaact tactatactc 
taagtatcag tgtttttacg atttgtagtt gatgtttctt atatatatat ataagtgtga 
ttatgatgc 

(2) INFORMATION FOR SEQ ID NO: 93: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1. .180 

(D) OTHER INFORMATION: / Ceres Seq. ID 1020785 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
Met Thr Ser Ser Ser Tyr Leu Arg Phe Ala lie Ala Val Val Ala Phe 
1 5 ' 10 15 

Leu Ser lie Thr Thr lie Thr Thr Ala Arg Pro Cys Lys Thr Phe Leu 

20 25 30 

lie Ser Ser Tyr Ser Leu Ser lie Thr Pro Glu Asn Pro Asn Leu Glu 

35 ~ 40 45 

Ser Asp Phe Thr Ser Thr Arg Phe lie Thr Val Phe Thr lie Arg Arg 

50 55 60 

Leu Asn Pro His His Val Val Pro Phe Phe Val Asn Arg Arg His Glu 
65 70 75 80 

Lys Pro Gin lie Gin Ser Asp Arg Ser Leu Pro Leu lie Ser Asp Asn 

85 90 95 

lie Asn Ser Phe Arg Asp Arg Thr Arg Asp lie Leu Ser Val Val Val 

100 105 110 

Ala Leu Leu Phe Gly Val Gly Cys Gly Ala Leu Thr Ala Ala Thr Met 

115 120 125 

Tyr Leu Val Trp Ala Leu Val Val Asn Arg Gin Ser Tyr Asp Phe Glu 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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130 135 140 

Glu Glu Glu Asp Asp Tyr Glu Asn Asp Glu Ser Asp Ala Ala Ser Leu 
145 ~ 150 155 160 

Lys Lys Leu Gly Tyr Val Lys lie Pro Ala Pro Ala Pro Ala Pro Val 
165 170 175 

Lys Glu Ala Ala 
180 

INFORMATION FOR SEQ ID NO: 94: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 765 base pairs 

(B) TYPE: • nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
DNA (genomic) 



(2) 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



1.-.765 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021525 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
accctcactc acttttccgg tttcaaccat tagcacaaaa aatctgagct ccgaccaaaa 60 
attggggaac gacgatggcg acgactcttc agaaactctc ctctcaaatc caccgactct 120 
ctccattcac cagatctcta atcgttcgca catccgcaac gtctgctcct tctccgtctc 180 
tgggatcgaa gaaagtctct gaccgaatcg tcaagctttc tgcgatcgat cccgatggat 240 
acaagcagga catcattgga ctctctggcc agactctcct tcgtgcgctc actcacactg 300 
gtctgataga tccagcatcg catagattgg atgacatcga ggcttgctca gcggagtgcg 360 
aggttcagat cgcagaggaa tggctcgaga agctcccgcc gcgtacttac gatgaggagt 420 
atgtgctgaa gcgtagctct agatcccgta tcttgaacaa gcactcgcgt cttggttgcc 480 
aagttgtgtt aacgcaggag cttcaaggta tggtcgtcgc tgtccctgaa gctaagcctt 540 
gggatattcc gtaagttttt gttcttgagg atctcataaa tgttgggttc gttctcaatt 600 
tggctaataa aattcgcttt ggaaatttcg gtactctgaa gataactcca tttgaatctg 660 
taattttcgt ttgcttacat ttaaattttg taatgtggta aagaactctg cttgtttaaa 720 
cgattccctg cttcaagtat gggtgaatgt tgttttcttg attcc 
(2) INFORMATION FOR SEQ ID NO: 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



peptide 
1. . 159 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021526 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
Met Ala Thr Thr Leu Gin Lys Leu Ser Ser Gin lie His Arg Leu Ser 
1 5 " 10 15 

Pro Phe Thr Arg Ser Leu lie Val Arg Thr Ser Ala Thr Ser Ala Pro 

20 25 30 

Ser Pro Ser Leu Gly Ser Lys Lys Val Ser Asp Arg lie Val Lys Leu 

35 " ^40 45 

Ser Ala lie Asp Pro Asp Gly Tyr Lys Gin Asp lie lie Gly Leu Ser 

50 ~ 55 " ' 60 

Gly Gin Thr Leu Leu Arg Ala Leu Thr His Thr Gly Leu lie Asp Pro 
65 70 75 80 

Ala Ser His Arg Leu Asp Asp lie Glu Ala Cys Ser Ala Glu Cys Glu 

85 90 95 

Val Gin lie Ala Glu Glu Trp Leu Glu Lys Leu Pro Pro Arg Thr Tyr 

100 105 " 110 

Asp Glu Glu Tyr Val Leu Lys Arg Ser Ser Arg Ser Arg lie Leu Asn 

115 ^ ' 120 125 

Lys His Ser Arg Leu Gly Cys Gin Val Val Leu Thr Gin Glu Leu Gin 
130 135 140 
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Gly Met Val Val Ala Val Pro Glu Ala Lys Pro Trp Asp lie Pro 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 96: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 58 8 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021563 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



atcttcggaa agtctcattt ctcgatcccc aattcgtgga ttagggttaa aagaaccatt 60 

tttattctcg tcgcgcaaca acaaatccag atcgaaaaag gaagaagaga tcgaaatggc 120 

tttgagaagg gtttacagtg aaatcagagg gaagaaggtg acggagcttc caggctatat 180 

caaatcgact ttttcaatgg agaccgtgaa gacctctgtg aagagaggac tcgataacta 240 

caacgaaaaa tacattcaga ccagctccgt tgatcctatc cttcatatct gcttctacgg 300 

catggctttc tcttaccttg tcgctctc.cc taatgagcgt cgccatcttg agcatcagca 360 

gcatgctaag gagcacggtg gtcattgatc tcgtgggaat cgtttcgatc tcgagatgat 420 

tttagggggg ttgctgtgaa atctttctct gcttgatggt gacgacgact caagaattgt 480 

gtcttattgt ttcgttttct tgaattttcc tggataatgt tgacctaaag gaaaaccttt 540 



ctttcgaatt acactccatg atagtcaata attgaagcat catgatgc 
(2) INFORMATION FOR SEQ ID NO: 97: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..128 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021564 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



Ser 


Ser 


Glu 


Ser 


Leu 


He 


Ser 


Arg 


Ser 


Pro 


He 


Arg 


Gly 


Leu 


Gly 


Leu 


1 








5 










10 










15 




Lys 


Glu 


Pro 


Phe 


Leu 


Phe 


Ser 


Ser 


Arg 


Asn 


Asn 


Lys 


Ser 


Arg 


Ser 


Lys 








20 










25 










30 






Lys 


Glu 


Glu 


Glu 


He 


Glu 


Met 


Ala 


Leu 


Arg 


Arg 


Val 


Tyr 


Ser 


Glu 


He 






35 










40 










45 








Arg 


Gly 


Lys 


Lys 


Val 


Thr 


Glu 


Leu 


Pro 


Gly 


Tyr 


He 


Lys 


Ser 


Thr 


Phe 




50 










55 










60 










Ser 


Met 


Glu 


Thr 


Val 


Lys 


Thr 


Ser 


Val 


Lys 


Arg 


Gly 


Leu 


Asp 


Asn 


Tyr 


65 










70 










75 










80 


Asn 


Glu 


Lys 


Tyr 


He 


Gin 


Thr 


Ser 


Ser 


Val 


Asp 


Pro 


He 


Leu 


His 


He 










85 










90 










95 




Cys 


Phe 


Tyr 


Gly 


Met 


Ala 


Phe 


Ser 


Tyr 


Leu 


Val 


Ala 


Leu 


Pro 


Asn 


Glu 








100 










105 










110 






Arg 


Arg 


His 


Leu 


Glu 


His 


Gin 


Gin 


His 


Ala 


Lys 


Glu 


His 


Gly 


Gly 


His 






115 










120 










125 








(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO : 9i 


3 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..90 
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371 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021565 
SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Met 


Ala 


Leu 


Ara 


Ara 


Val 


Tvr 


Ser 


Glu 


He 


Ara 


Gly 


Lvs 


Lys 


Val 


Thr 


1 








5 










10 










15 




Glu 


Leu 


Pro 


Gly 


Tyr 


He 


Lys 


Ser 


Thr 


Phe 


Ser 


Met 


Glu 


Thr 


Val 


Lys 








20 










25 










30 






Thr 


Ser 


Val 


Lys 


Arg 


Gly 


Leu 


Asp 


Asn 


Tyr 


Asn 


Glu 


Lys 


Tyr 


He 


Gin 






35 










40 










45 








Thr 


Ser 


Ser 


Val 


Asp 


Pro 


He 


Leu 


His 


He 


Cys 


Phe 


Tyr 


Gly 


Met 


Ala 




50 










55 










60 










Phe 


Ser 


Tyr 


Leu 


Val 


Ala 


Leu 


Pro 


Asn 


Glu 


Arg 


Arg 


His 


Leu 


Glu 


His 


65 










70 










75 










80 


Gin 


Gin 


His 


Ala 


Lys 


Glu 


His 


Gly 


Gly 


His 















ID 1021576 



85 90 
(2) INFORMATION FOR SEQ ID NO: 99: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..58 6 

(D) OTHER INFORMATION: / Ceres Seq. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
ttgcctaaat cattttaaag actgtataga ggaaaacaaa actgcaaaaa caaaaataaa 
aaaaacatcg cacaagaaaa taaaagattt gtagaatcaa ctaagaaaat ggctagcact 
atgatgacta cattgcctca gttcaatggt cttcgagcca ccaaaatctc tgcagctcct 
gtacaaggcc tggcaagtgt tcagcccatg agacgcaagg gaaatggagc tttgggtgca 
aagtgtgact tcatcggttc atcaacaaat ctgataatgg taacgtcgac gaccctgatg 
ttgttcgcgg ggagattcgg acttgcgcca tcagccaata ggaaggcgac agctggactt 
aggttggagg cacgtgactc aggtctacaa acgggtgacc cggccgggtt cacgcttgcg 
gacactttgg cttgtggcac cgttggtcat atcatcggtg taggagttgt ccttggcctt 
aaaaacattg gtgctatttg aagttcctaa agctctttta tttgtatttg taaaatttgt 
agatttttat aacaatattc tcatgcacct gaacgagatc taatgc 
(2) INFORMATION FOR SEQ ID NO: 100: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1. . 130 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021577 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
Met Ala Ser Thr Met Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg 
15 10 15 

Ala Thr Lys He Ser Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin 

20 25 30 

Pro Met Arg Arg Lys Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe 

35 40 45 

He Gly Ser Ser Thr Asn Leu lie Met Val Thr Ser Thr Thr Leu Met 

50 55 60 

Leu Phe Ala Gly Arg Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala 
65 70 "* 75 80 

Thr Ala Gly Leu Arg Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly 

85 90 ' 95 

Asp Pro Ala Gly Phe Thr Leu Ala Asp Thr Leu Ala Cys Gly Thr Val 
100 105 110 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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Gly His lie lie Gly Val Gly Val Val Leu Gly Leu Lys Asn He Gly 
115 120 125 

Ala He 
130 

5 (2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..126 

15 (D) OTHER INFORMATION: / Ceres Seq. ID 1021578 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
Met Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg Ala Thr Lys He 
15 10 15 

Ser Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin Pro Met Arg Arg 
20 20 25 30 

Lys Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe He Gly Ser Ser 

35 40 45 

Thr Asn Leu He Met Val Thr Ser Thr Thr Leu Met Leu Phe Ala Gly 
50 55 60 

25 Arg Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala Thr Ala Gly Leu 
65 70 75 80 

Arg Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly Asp Pro Ala Gly 

85 90 95 

Phe Thr Leu Ala Asp Thr Leu Ala Cys Gly Thr Val Gly His He He 
30 100 105 110 

Gly Val Gly Val Val Leu Gly Leu Lys Asn He Gly Ala He 

115 120 125 

(2) INFORMATION FOR SEQ ID NO: 102: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 125 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021579 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
4 5 Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg Ala Thr Lys He Ser 
15 10 15 

Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin Pro Met Arg Arg Lys 

20 25 30 

Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe He Gly Ser Ser Thr 
50 35 4 0 * ' 45 

Asn Leu He Met Val Thr Ser Thr Thr Leu Met Leu Phe Ala Gly Arg 

50 55 60 

Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala Thr Ala Gly Leu Arg 
65 ' 7 0 75 80 

55 Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly Asp Pro Ala Gly Phe 

85 90 95 

Thr Leu Ala Asp Thr Leu Ala Cys Gly Thr Val Gly His He He Gly 

100 105 110 

Val Gly Val Val Leu Gly Leu Lys Asn He Gly Ala He 
60 115 120 125 

(2) INFORMATION FOR SEQ ID NO: 103: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..801 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021927 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

atttttctct cagaatccat aaaaagagag agagataaat aaagagaaaa 
tagaagatgg agaaaagtaa tgatcatgac aaggctagcc acggcggctc 
gccacggaga agtgggagga gactagcctc ggaatccgaa ctgccgagac 
ttagctccgg tgggtctttg tgttgcagcg cttgttgtca tgcttaagga 

15 aatgagttcg gttcaatttc ttactccaat ctcacagcct ttaggtactt 
aatggaatat gtgcaggcta ctctcttcta tcagcagcca ttgcagcgat 
tcttcgacaa tgcctcgtgt ttggaccttc ttttgtctcg accagcttct 
gttcttgctg ctggagctgt atcagctgag gttctatact tggcttacaa 
gccattactt ggagcgatgc atgtagctcc tacggcggtt tctgtcatag 

20 tctgttataa tcacattctt tgtggtttgt ttctacatcg ttctctctct 
tataagctct ttactcgctt tgatcctcct tccattgttg actccgccaa 
gtcgctgtct tcggaagtta gatcctccat ttgctcttca agtacatcta 
tttcaaatgt tgtgtgtttt taactttggt caagagaaag aatgcttatg 
tgtttcattg cttttcttcc t 

25 (2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..204 
35 (D) OTHER INFORMATION: / Ceres Seq. ID 1021928 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



Met 


Glu 


Lys 


Ser 


Asn 


Asp 


His 


Asp 


Lys 


Ala 


Ser 


His 


Gly 


Gly 


Ser 


Gly 


1 








5 










10 










15 




Gly 


Gly 


Ala 


Thr 


Glu 


Lys 


Trp 


Glu 


Glu 


Thr 


Ser 


Leu 


Gly 


He 


Arg 


Thr 








20 










25 










30 






Ala 


Glu 


Thr 


Met 


Leu 


Arg 


Leu 


Ala 


Pro 


Val 


Gly 


Leu 


Cys 


Val 


Ala 


Ala 






35 










40 










45 








Leu 


Val 


Val 


Met 


Leu 


Lys 


Asp 


Ser 


Glu 


Thr 


Asn 


Glu 


Phe 


Gly 


Ser 


He 




50 










55 










60 










Ser 


Tyr 


Ser 


Asn 


Leu 


Thr 


Ala 


Phe 


Arg 


Tyr 


Leu 


Val 


His 


Ala 


Asn 


Gly 


65 










70 










75 










80 


He 


Cys 


Ala 


Gly 


Tyr 


Ser 


Leu 


Leu 


Ser 


Ala 


Ala 


He 


Ala 


Ala 


Met 


Pro 










85 










90 










95 




Arg 


Ser 


Ser 


Ser 


Thr 


Met 


Pro 


Arg 


Val 


Trp 


Thr 


Phe 


Phe 


Cys 


Leu 


Asp 








100 










105 










110 






Gin 


Leu 


Leu 


Thr 


Tyr 


Leu 


Val 


Leu 


Ala 


Ala 


Gly 


Ala 


Val 


Ser 


Ala 


Glu 






115 










120 










125 








Val 


Leu 


Tyr 


Leu 


Ala 


Tyr 


Asn 


Gly 


Asp 


Ser 


Ala 


He 


Thr 


Trp 


Ser 


Asp 




130 










135 










140 










Ala 


Cys 


Ser 


Ser 


Tyr 


Gly 


Gly 


Phe 


Cys 


His 


Arg 


Ala 


Thr 


Ala 


Ser 


Val 


145 










150 










155 










160 


He 


He 


Thr 


Phe 


Phe 


Val 


Val 


Cys 


Phe 


Tyr 


He 


Val 


Leu 


Ser 


Leu 


He 










165 










170 










175 




Ser 


Ser 


Tyr 


Lys 


Leu 


Phe 


Thr 


Arg 


Phe 


Asp 


Pro 


Pro 


Ser 


He 


Val 


Asp 








180 










185 










190 






Ser 


Ala 


Lys 


Asn 


Leu 


Glu 


Val 


Ala 


Val 


Phe 


Gly 


Ser 














195 










200 



















ctgaagaagc 60 

cggtggtggt 120 

aatgctccgg 180 

ctctgagact 240 

ggtgcacgca 300 

gcctcgttct 360 

gacctacctg 420 

tggagactca 480 

agccactgct 540 

aatctcctct 600 

gaatctcgaa 660 

gtttgtcatg 720 

tatttcctct 780 
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(2) INFORMATION FOR SEQ ID NO: 105: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
10 (B) LOCATION: 1..169 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021929 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
Met Leu Arg Leu Ala Pro Val Gly Leu Cys Val Ala Ala Leu Val Val 
1 5 10 15 

15 Met Leu Lys Asp Ser Glu Thr Asn Glu Phe Gly Ser lie Ser Tyr Ser 

20 25 30 

Asn Leu Thr Ala Phe Arg Tyr Leu Val His Ala Asn Gly lie Cys Ala 

35 40 45 

Gly Tyr Ser Leu Leu Ser Ala Ala lie Ala Ala Met Pro Arg Ser Ser 
20 50 55 60 

Ser Thr Met Pro Arg Val Trp Thr Phe Phe Cys Leu Asp Gin Leu Leu 
65 70 75 80 

Thr Tyr Leu Val Leu Ala Ala Gly Ala Val Ser Ala Glu Val Leu Tyr 
85 90 95 

25 Leu Ala Tyr Asn Gly Asp Ser Ala lie Thr Trp Ser Asp Ala Cys Ser 

100 " 105 110 

Ser Tyr Gly Gly Phe Cys His Arg Ala Thr Ala Ser Val lie lie Thr 

115 120 125 

Phe Phe Val Val Cys Phe Tyr lie Val Leu Ser Leu lie Ser Ser Tyr 
30 130 " 135 140 

Lys Leu Phe Thr Arg Phe Asp Pro Pro Ser lie Val Asp Ser Ala Lys 
145 150 155 160 

Asn Leu Glu Val Ala Val Phe Gly Ser 
163 

35 (2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..153 

45 (D) OTHER INFORMATION: / Ceres Seq. ID 1021930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
Met Leu Lys Asp Ser Glu Thr Asn Glu Phe Gly Ser lie Ser Tyr Ser 
15 10 15 

Asn Leu Thr Ala Phe Arg Tyr Leu Val His Ala Asn Gly lie Cys Ala 
50 20 25 30 

Gly Tyr Ser Leu Leu Ser Ala Ala lie Ala Ala Met Pro Arg Ser Ser 

35 40 45 

Ser Thr Met Pro Arg Val Trp Thr Phe Phe Cys Leu Asp Gin Leu Leu 
50 55 60 

55 Thr Tyr Leu Val Leu Ala Ala Gly Ala Val Ser Ala Glu Val Leu Tyr 
65 70 75 80 

Leu Ala Tyr Asn Gly Asp Ser Ala lie Thr Trp Ser Asp Ala Cys Ser 

8 5 90 ' 95 

Ser Tyr Gly Gly Phe Cys His Arg Ala Thr Ala Ser Val lie lie Thr 
60 ' 100 " 105 110 

Phe Phe Val Val Cys Phe Tyr lie Val Leu Ser Leu lie Ser Ser Tyr 
115 ~ 120 125 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



375 

Lys Leu Phe Thr Arg Phe Asp Pro Pro Ser lie Val Asp Ser Ala Lys 

130 135 140 

Asn Leu Glu Val Ala Val Phe Gly Ser 



145 

(2) 



150 



(ii) 
(ix) 



(xi) 



ID 1021945 



INFORMATION FOR SEQ ID NO: 107: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..602 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
atcgagcgat ggatttggca gaactgtggg cgattttcgg acccggattc tccggcgccg 
ttttcggaac cgggtggtgg ggtcgacgcc gtcgtttgca gttccatcca agttcccttc 
gtccattacc ttcccggcat attcgcttct ctcggagctt tgatgttcaa ttgcgtcaga 
aaagaagaca ttgattactc tccttacgac gaaggcgagt ggagattgaa gctgtggctt 
ttcatagcgt atgtcgtagc atttgtttcc ttagctgctt ctgttggctt gctgattcaa 
gattcggttg tgaagactgg gccttcaact tggactggtg tggctggtgt ctttcaatgt 
gtatttgtat tgataagtgg gctaatgtat tggacatcgc actcagagta gcgcacaggc 
acatacttca tctccgactc agattcatac tgtgacaact tacaagttcc tctcgtttct 
ctactatgta aattcctgtg tcttgtcttg tcaccttggc attgtgtgcg agattcctct 
gttgcatcag tgtgtgtttc tgttaaaact ttgtgtgaat gttattgaaa atactccaaa 
cc 

INFORMATION FOR SEQ ID NO: 108: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(2) 



(ii) MOLZCULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1. .136 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021946 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
lie Glu Arg Trp He Trp Gin Asn Cys Gly Arg Phe Ser Asp Pro Asp 
1 5 10 15 

Ser Pro Ala Pro Phe Ser Glu Pro Gly Gly Gly Val Asp Ala Val Val 

20 25 30 

Cys Ser Ser He Gin Val Pro Phe Val His Tyr Leu Pro Gly He Phe 

35 40 45 

Ala Ser Leu Gly Ala Leu Met Phe Asn Cys Val Arg Lys Glu Asp He 

50 " 55 60 

Asp Tyr Ser Pro Tyr Asp Glu Gly Glu Trp Arg Leu Lys Leu Trp Leu 
65 70 75 80 

Phe He Ala Tyr Val Val Ala Phe Val Ser Leu Ala Ala Ser Val Gly 

85 90 95 

Leu Leu He Gin Asp Ser Val Val Lys Thr Gly Pro Ser Thr Trp Thr 

100 105 110 

Gly Val Ala Gly Val Phe Gin Cys Val Phe Val Leu He Ser Gly Leu 

115 120 125 

Met Tyr Trp Thr Ser His Ser Glu 

130 135 
(2) INFORMATION FOR SEQ ID NO: 109: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
5 (B) LOCATION: 1..82 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021947 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 





Met 


Phe 


Asn 


Cys 


Val 


Arg 


Lys 


Glu 


Asp 


He 


Asp 


Tyr 


Ser 


Pro 


Tyr 


Asp 




1 








5 










10 










15 




10 


Glu 


Gly 


Glu 


Trp 
20 


Arg 


Leu 


Lys 


Leu 


Trp 
25 


Leu 


Phe 


He 


Ala 


Tyr 
30 


Val 


Val 




Ala 


Phe 


Val 

35 


Ser 


Leu 


Ala 


Ala 


Ser 
40 


Val 


Gly 


Leu 


.Leu 


He 
45 


Gin 


Asp 


Ser 




Val 


Val 


Lys 


Thr 


Gly 


Pro 


Ser 


Thr 


Trp 


Thr 


Gly 


Val 


Ala 


Gly 


Val 


Phe 


15 




50 _ 










55 










60 












Gin 


Cys 


Val 


Phe 


Val 


Leu 


He 


Ser 


Gly 


Leu 


Met 


Tyr 


Trp 


Thr 


Ser 


His 




65 










70 










75 










80 



Ser Glu 

20 (2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..693 

30 (D) OTHER INFORMATION: / Ceres Seq. ID 1022170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
gagtcgctct tctttccttc cgccattttt ttctttctag g^.tgcagagt taaaggagaa 
ggttttcaat tagggttttg tagagagaaa gatgagccga ag ttgggaa taccggtgaa 
gcttcttcac gaggcctcag gtcatatcgt gacggtggag ctaa^gagcg gcgagcttta 

35 cagaggaagt atgattgagt gtgaggataa ctggaactgt cagctcgagg atattactta 
taccgccaag gatggtaagg tatcacagct tgagcatgtc ttcattcgag gcagtaaagt 
caggtttatg gtcataccag acattctcaa acatgctcca atgttcaagc ggttagatgc 
tagaatcaag ggaaagagct catcactggg tgttggcaga ggtagagctg caatgcgagg 
gaaaccggct gccgggcccg ggcgtggaac tggaggaagg ggagcggtac cacctgtgag 

4 0 gagatgattg attttacgct caacatgtaa gctttgattc tgggatagtc tttctgtgat 
atccaggtga aagccgcctt tcttggttct gtcattaatg ctttagagaa cactgtaaaa 
ccaaacaaga gagagatata tccagatgca ctagtagttc tgtattttgg gttaccaaga 
tgtttgaatc attgacaagt tttcgttgtg ate 
(2) INFORMATION FOR SEQ ID NO: 111: 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
50 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..131 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022171 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Met Ser Arg Ser Leu Gly He Pro Val Lys Leu Leu His Glu Ala Ser 
15 10 15 

Gly His He Val Thr Val Glu Leu Lys Ser Gly Glu Leu Tyr Arg Gly 
20 25 30 

60 Ser Met He Glu Cys Glu Asp Asn Trp Asn Cys Gin Leu Glu Asp He 
35 4 0 4 5 

Thr Tyr Thr Ala Lys Asp Gly Lys Val Ser Gin Leu Glu His Val Phe 
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50 55 60 

lie Arg Gly Ser Lys Val Arg Phe Met Val lie Pro Asp lie Leu Lys 
65 70 75 80 

His Ala Pro Met Phe Lys Arg Leu Asp Ala Arg lie Lys Gly Lys Ser 
5 85 90 95 

Ser Ser Leu Gly Val Gly Arg Gly Arg Ala Ala Met Arg Gly Lys Pro 

100 105 110 

Ala Ala Gly Pro Gly Arg Gly Thr Gly Gly Arg Gly Ala Val Pro Pro 
115 ~ " 120 ~ ~ * 125 

10 Val Arg Arg 
130 

(2) INFORMATION FOR SEQ ID NO: 112: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

20 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 98 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022172 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
Met lie Glu Cys Glu Asp Asn Trp Asn Cys Gin Leu Glu Asp lie Thr 
25 1 "5 10 ' 15 

Tyr Thr Ala Lys Asp Gly Lys Val Ser Gin Leu Glu His Val Phe lie 

20 25 30 

Arg Gly Ser Lys Val Arg Phe Met Val lie Pro Asp lie Leu Lys His 
35 4 0 4 5 

3 0 Ala Pro Met Phe Lys Arg Leu Asp Ala Arg lie Lys Gly Lys Ser Ser 

50 55 60 

Ser Leu Gly Val Gly Arg Gly Arg Ala Ala Met Arg Gly Lys Pro Ala 
65 " ' 70 ~ 75 80 

Ala Gly Pro Gly Arg Gly Thr Gly Gly Arg Gly Ala Val Pro Pro Val 
35 85 90 95 

Arg Arg 

(2) INFORMATION FOR SEQ ID NO: 113: 
(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 65 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022173 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
50 Glu Ser Leu Phe Phe Pro Ser Ala lie Phe Phe Phe Leu Gly Cys Arg 
15 10 15 

Val Lys Gly Glu Gly Phe Gin Leu Gly Phe Cys Arg Glu Lys Asp Glu 

20 25 " 30 

Pro Lys Phe Gly Asn Thr Gly Glu Ala Ser Ser Arg Gly Leu Arg Ser 
55 35 40 45 

Tyr Arg Asp Gly Gly Ala Lys Glu Arg Arg Ala Leu Gin Arg Lys Tyr 
50 55 60 

Asp 
65 

60 (2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..706 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022554 
(xi) SEQUFNCE DESCRIPTION: SEQ ID NO: 114: 
agaccaaaaa agcggttggt gtaagatccc aaactcacag attcccaaat aatagtaata 



ctcttcctct tctcaactct caccagtcac cagcagatca tcggagatgg gcggagacgg 12 

aaaagttttc accttgtccg aggttcctca gcacagtagc gccaaggatt gttggatcgt 18 

catcgacggc aaggtttatg atgtgacaaa gttcttggat gatcatcctg gtggtgatga 24 

ggttatcttg acttctacag ggaaagatgc gaccgatgat ttcgaggatg tgggacatag 30 

ttcgactgcg aaagccatgc tagatgagta ctatgtgggt gatattgaca cagctactgt 36 

gccggttaaa gctaagtttg tgcctcctac gtcgacgaaa gccgtggcta ctcaggataa 42 

gagctcggat tttgttatta agctccttca gttccttgtt ccacttctaa tcttaggctt 48 

ggctttcggc attcggtact acactaagac caaggctcct tcttcttgaa agattgaatt 54 

gtggcctaag tgaattgctt gtgtacgacg tggttaaact ctgtctctac ctctgtttgt 60 

cttgtaagac atcattatat tctctattgc aatcaactga tagacctttg atttttaagc 66 



catctacctt tgttgtctta tatcaatcaa atccatttac aatatc 
(2) INFORMATION FOR SEQ ID NO: 115: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..14 0 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022555 
(xi) SEQUENCE DESCRIPTION . SEQ ID NO: 115: 



Met 


Gly 


Gly 


Asp 


Gly 


Lys 


Val 


Phe 


Thr 


Leu 


Ser 


Glu 


Val 


Ser 


Gin 


His 


1 








5 










10 










15 




Ser 


Ser 


Ala 


Lys 


Asp 


Cys 


Trp 


He 


Val 


He 


Asp 


Gly 


Lys 


Val 


Tyr 


Asp 








20 










25 










30 






Val 


Thr 


Lys 


Phe 


Leu 


Asp 


Asp 


His 


Pro 


Gly 


Gly 


Asp 


Glu 


Val 


He 


Leu 






35 










40 










45 








Thr 


Ser 


Thr 


Gly 


Lys 


Asp 


Ala 


Thr 


Asp 


Asp 


Phe 


Glu 


Asp 


Val 


Gly 


His 




50 










55 










60 










Ser 


Ser 


Thr 


Ala 


Lys 


Ala 


Met 


Leu 


Asp 


Glu 


Tyr 


Tyr 


Val 


Gly 


Asp 


He 


65 










70 










75 










80 


Asp 


Thr 


Ala 


Thr 


Val 


Pro 


Val 


Lys 


Ala 


Lys 


Phe 


Val 


Pro 


Pro 


Thr 


Ser 










85 










90 










95 




Thr 


Lys 


Ala 


Val 


Ala 


Thr 


Gin 


Asp 


Lys 


Ser 


Ser 


Asp 


Phe 


Val 


He 


Lys 








100 










105 










110 






Leu 


Leu 


Gin 


Phe 


Leu 


Val 


Pro 


Leu 


Leu 


He 


Leu 


Gly 


Leu 


Ala 


Phe 


Gly 






115 










120 










125 








He 


Arg 


Tyr 


Tyr 


Thr 


Lys 


Thr 


Lys 


Ala 


Pro 


Ser 


Ser 












130 










135 










140 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO:116: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 70 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022556 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
Met Leu Asp Glu Tyr Tyr Val Gly Asp lie Asp Thr Ala Thr Val Pro 
1 5 10 ~ 15 

Val Lys Ala Lys Phe Val Pro Pro Thr Ser Thr Lys Ala Val Ala Thr 

20 25 30 

Gin Asp Lys Ser Ser Asp Phe Val lie Lys Leu Leu Gin Phe Leu Val 

35 40 ^ 45 

Pro Leu Leu lie Leu Gly Leu Ala Phe Gly lie Arg Tyr Tyr Thr Lys 

50 55 ~ 60 

Thr Lys Ala Pro Ser Ser 
65 ~ 7Q 
(2) INFORMATION FOR SEQ ID NO: 117: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..722 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022594 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
aacgtcatca ttctgacttt tctcaaagaa gagaaatcaa atcaccaagt catcggagcg 
atcacagaga agaagaaaac catggcggaa gaagagaaaa gcgtgatgaa qcaagtgatg 
gtggcgattg atgaaagcga atgtagtaaa cgcgctcttc aatggacgct cgtgtatctc 
aaagatagcc tcgccgattc cgacatcatc ctcttcactg ctcaacctca tcttgatctc 
agctgcgtct atgcttcctc ttatggcgcc gctccgattg agcttataaa ctcattgcaa 
gagagtcata agaacgctgg attgaatcgt cttgatgaag gaaccaaaat ttgtgctgag 
actggggtta ctccaagaaa ggtgttggaa tttggaaatc ctaaagaagc gatatgtgaa 
gctgctgaga agcttggtgt tgatatgctt gtggttggaa gccatggtaa aggagcatta 
caaaggactt tccttggaag tgttagcaat tactgtgtta acaatgctaa gtgcccagtt 
cttgtggtga gaacaaaagc ttgaagacct cttctcatga aagagttcga caaccctggc 
ttgatgtgtg tgtgttgtaa acatattgat aataatgctc tgcttggtta ctattgtttg 
aagaaacttt gtgagtaaac ataatttatt ttgtttgata atgaaagagc attgccgtgt 
tc 

(2) INFORMATION FOR SEQ ID NO: 118: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..187 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022595 
SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
Asn Val lie lie Leu Thr Phe Leu Lys Glu Glu Lys Ser Asn His Gin 
1 5 10 15 

Val He Gly Ala He Thr Glu Lys Lys Lys Thr Met Ala Glu Glu Glu 

20 25 " 30 

Lys Ser Val Met Lys Gin Val Met Val Ala He Asp Glu Ser Glu Cys 

35 40 45 

Ser Lys Arg Ala Leu Gin Trp Thr Leu Val Tyr Leu Lys Asp Ser Leu 

50 55 " 60 

Ala Asp Ser Asp He He Leu Phe Thr Ala Gin Pro His Leu Asp Leu 
65 70 7 5 8 0 

Ser Cys Val Tyr Ala Ser Ser Tyr Gly Ala Ala Pro He Glu Leu He 

8 5 " 90 95 

Asn Ser Leu Gin Glu Ser His Lys Asn Ala Gly Leu Asn Arg Leu Asp 
100 " 105 110 



(ii) 
(ix) 



(xi) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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Glu Gly Thr Lys lie Cys Ala Glu Thr Gly Val Thr Pro Arg Lys Val 

115 " ~ 120 125 

Leu Glu Phe Gly Asn Pro Lys Glu Ala lie Cys Glu Ala Ala Glu Lys 
130 135 140 

5 Leu Gly Val Asp Met Leu Val Val Gly Ser His Gly Lys Gly Ala Leu 
145 150 155 160 

Gin Arg Thr Phe Leu Gly Ser Val Ser Asn Tyr Cys Val Asn Asn Ala 

165 170 175 

Lys Cys Pro Val Leu Val Val Arg Thr Lys Ala 
10 180 185 

(2) INFORMATION FOR SEQ ID NO: 119: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
2 0 (B) LOCATION: 1..160 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022596 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
Met Ala Glu Glu Glu Lys Ser Val Met Lys Gin Val Met Val Ala He 
15 10 15 

2 5 Asp Glu Ser Glu Cys Ser Lys Arg Ala Leu Gin Trp Thr Leu Val Tyr 

20 25 30 

Leu Lys Asp Ser Leu Ala Asp Ser Asp He He Leu Phe Thr Ala Gin 

35 4 0 ~ 45 

Pro His Leu Asp Leu Ser Cys Val Tyr Ala Ser Ser Tyr Gly Ala Ala 
30 50 55 60 

Pro He Glu Leu He Asn Ser Leu Gin Glu Ser His Lys Asn Ala Gly 
65 70 75 80 

Leu Asn Arg Leu Asp Glu Gly Thr Lys He Cys Ala Glu Thr Gly Val 
85 " 90 95 

35 Thr Pro Arg Lys Val Leu Glu Phe Gly Asn Pro Lys Glu Ala He Cys 

100 105 110 

Glu Ala Ala Glu Lys Leu Gly Val Asp Met Leu Val Val Gly Ser His 

115 120 125 

Gly Lys Gly Ala Leu Gin Arg Thr Phe Leu Gly Ser Val Ser Asn Tyr 
40 ~ 130 ~ 135 140 

Cys Val Asn Asn Ala Lys Cys Pro Val Leu Val Val Arg Thr Lys Ala 
145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 120: 
(i) SEQUENCE CHARACTERISTICS: 
4 5 (A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

50 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 152 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022597 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
55 Met Lys Gin Val Met Val Ala He Asp Glu Ser Glu Cys Ser Lys Arg 
15 10 15 

Ala Leu Gin Trp Thr Leu Val Tyr Leu Lys Asp Ser Leu Ala Asp Ser 

20 25 30 

Asp He He Leu Phe Thr Ala Gin Pro His Leu Asp Leu Ser Cys Val 
60 35 40 45 

Tyr Ala Ser Ser Tyr Gly Ala Ala Pro He Glu Leu He Asn Ser Leu 
50 55 60 
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Gin Glu Ser His Lys Asn Ala Gly Leu Asn Arg Leu Asp Glu Gly Thr 
65 70 75 80 

Lys lie Cys Ala Glu Thr Gly Val Thr Pro Arg Lys Val Leu Glu Phe 

85 90 95 

Gly Asn Pro Lys Glu Ala lie Cys Glu Ala Ala Glu Lys Leu Gly Val 

100 105 110 

Asp Met Leu Val Val Gly Ser His Gly Lys Gly Ala Leu Gin Arg Thr 

115 " 120 ' ~ " 125 

Phe Leu Gly Ser Val Ser Asn Tyr Cys Val Asn Asn Ala Lys Cys Pro 

130 ' 135 140 

Val Leu Val Val Arg Thr Lys Ala 

150 

INFORMATION FOR SEQ ID NO: 121: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..610 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022621 
SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
aattttgtgt ggtaattaac tggtgagaga gttcaacatc caatctttct ctctctctct 
atcttttatc tctcacactc ttaaaatgct tcttctatct ccgatctctg cctctcttcc 
accgtcgttt caccggggta atttgatccg gcgatcaatt aagccgttag gtagggttgt 
tgcaaaggcc aaggacaaca ccgacactgg tggatttctg gagacggcag ctatagccgg 
tggtttagtg tcgacgccgg ttattggatg gtcactctac actttgaaga ccacgggatg 
tggcttacct cccggaccgg ccggtttaat cggagcgttg gaaggtgtga gctacttggt 
ggtggttggc atcgtgggct ggtctttgta cactaaaaca aaaactgggt caggtctgcc 
aaatgggcca tttggtttgt tgggtgcggt cgagggttta tcgtatctgt cggttctagc 
cattcttgtg gtgtttggta ttcagttctt ggataatggg tcggttccag gtccacttcc 
tagtgaccag tgttttggtt aaccacgtgt aatgtcgtat aaataaagtg tcaaattaat 
aattgctttc 

(2) INFORMATION FOR SEQ ID NO: 122: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



(xi) 



(ii) 
(ix) 



(xi) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..158 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022622 
SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
Met Leu Leu Leu Ser Pro lie Ser Ala Ser Leu Pro Pro Ser Phe His 
15 10 15 

Arg Gly Asn Leu lie Arg Arg Ser lie Lys Pro Leu Gly Arg Val Val 

20 25 30 

Ala Lys Ala Lys Asp Asn Thr Asp Thr Gly Gly Phe Leu Glu Thr Ala 

35 40 ' 45 

Ala lie Ala Gly Gly Leu Val Ser Thr Pro Val lie Gly Trp Ser Leu 

50 55 60 

Tyr Thr Leu Lys Thr Thr Gly Cys Gly Leu Pro Pro Gly Pro Ala Gly 
65 70 ^ 75 ^ 80 

Leu lie Gly Ala Leu Glu Gly Val Ser Tyr Leu Val Val Val Gly lie 

85 90 95 

Val Gly Trp Ser Leu Tyr Thr Lys Thr Lys Thr Gly Ser Gly Leu Pro 

100 105 ' 110 

Asn Gly Pro Phe Gly Leu Leu Gly Ala Val Glu Gly Leu Ser Tyr Leu 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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115 120 125 

Ser Val Leu Ala lie Leu Val Val Phe Gly lie Gin Phe Leu Asp Asn 

130 135 ~ 140 

Gly Ser Val Pro Gly Pro Leu Pro Ser Asp Gin Cys Phe Gly 
14 5 ' 150 155 

(2) INFORMATION FOR SEQ ID NO: 123: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .872 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024375 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



atcgaacatt aaaacctgat tttactgaaa ctgtgctgcg aaggagaagc cgcaaaccat 60 

tttttgtcaa tctagggttc ttcttcgtac ttgcagattt cgcctccgtt ttcactacga 120 

tctgtgaaga tgttctccgc tcagaacaag atcaataagg acaagaatgc tgagccaaca 180 

gaatgcyagg agcaagttgc tcaggctttg tttgatttgg agaacactaa ccaggagttg 240 

aaaagcgagt tgaaagatct ctacatcaac caagctgttc acatggatat ctctggaaac 300 

cgcaaagctg ttgtgattta cgttccattc agattgagga aagctttccg caagattcat 360 

ccccgtctcg tcagagagct tgagaagaag ttcagtggaa aggatgttat ctttgttacc 420 

acaagaagga tcatgcgtcc ccccaagaag ggtgctgctg ttcagaggcc acgtaacaga 480 

actcttacct cagttcatga agctatgctt gaagatgttg ctttccccgc tgagattgtt 540 

ggaaagcgta ctcgctaccg tcttgatggt tccaagatca tgaaggtctt tttggatgcc 600 

aaggaaaaga acaacacaga gtacaagctc gagactatgg tcggtgtgta ccgtaaactt 660 

actggcaaag atgttgtttt tgagtaccca gtcgaagctt gaaagaagat gatgaagaac 720 

catcaggata gtgaaagaga gcttttgttt atgttttgtg gtatttagga tgaaggaaac 780 

tctcttgatt cagttccttg ttcacaatct ttaatgttct atttacaatg actacttttt 840 



gtgtttttca attttgaaac ctctatgatt tc 
(2) INFORMATION FOR SEQ ID NO: 124: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..190 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024376 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



Met 


Phe 


Ser 


Ala 


Gin 


Asn 


Lys 


He 


Asn 


Lys 


Asp 


Lys 


Asn 


Ala 


Glu 


Pro 


1 








5 










10 










15 




Thr 


Glu 


Cys 


Glu 


Glu 


Gin 


Val 


Ala 


Gin 


Ala 


Leu 


Phe 


Asp 


Leu 


Glu 


Asn 








20 










25 










30 






Thr 


Asn 


Gin 


Glu 


Leu 


Lys 


Ser 


Glu 


Leu 


Lys 


Asp 


Leu 


Tyr 


He 


Asn 


Gin 






35 










40 










45 








Ala 


Val 


His 


Met 


Asp 


He 


Ser 


Gly 


Asn 


Arg 


Lys 


Ala 


Val 


Val 


He 


Tyr 




50 










55 










60 










Val 


Pro 


Phe 


Arg 


Leu 


Arg 


Lys 


Ala 


Phe 


Arg 


Lys 


He 


His 


Pro 


Arg 


Leu 


65 










70 










75 










80 


Val 


Arg 


Glu 


Leu 


Glu 


Lys 


Lys 


Phe 


Ser 


Gly 


Lys 


Asp 


Val 


He 


Phe 


Val 










85 










90 










95 




Thr 


Thr 


Arg 


Arg 


He 


Met 


Arg 


Pro 


Pro 


Lys 


Lys 


Gly 


Ala 


Ala 


Val 


Gin 








100 










105 










110 






Arg 


Pro 


Arg 


Asn 


Arg 


Thr 


Leu 


Thr 


Ser 


Val 


His 


Glu 


Ala 


Met 


Leu 


Glu 






115 










120 










125 








Asp 


Val 


Ala 


Phe 


Pro 


Ala 


Glu 


He 


Val 


Gly 


Lys 


Arg 


Thr 


Arg 


Tyr 


Arg 




130 










135 










140 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 98 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024536 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



Met 


Ser 


Gly 


Glu 


Glu 


Glu 


Ala 


Thr 


Val 


Arg 


Glu 


Pro 


Leu 


Asp 


Leu 


He 


1 








5 










10 










15 




Arg 


Leu 


Ser 


Leu 


Asp 


Glu 


Arg 


He 


Tyr 


Val 


Lys 


Leu 


Arg 


Ser 


Asp 


Arg 








20 










25 










30 






Glu 


Leu 


Arg 


Gly 


Lys 


Leu 


His 


Ala 


Phe 


Asp 


Gin 


His 


Leu 


Asn 


Met 


He 






35 










40 










45 








Leu 


Gly 


Asp 


Val 


Glu 


Glu 


Thr 


He 


Thr 


Thr 


Val 


Glu 


He 


Asp 


Asp 


Glu 




50 










55 










60 










Thr 


Tyr 


Glu 


Glu 


He 


Val 


Arg 


Thr 


Thr 


Lys 


Arg 


Thr 


He 


Glu 


Phe 


Leu 


65 










70 










75 










80 


Phe 


Val 


Arg 


Gly 


Asp 


Gly 


Val 


He 


Leu 


Val 


Ser 


Pro 


Pro 


Leu 


Arg 


Thr 



2 0 8 5 90 95 

Ala Ala 

(2) INFORMATION FOR SEQ ID NO: 128: 
(i) SEQUENCE CHARACTERISTICS: 

2 5 (A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

3 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..53 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024537 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
35 Lys Leu Leu Ser Ser Cys Pro Arg Asp Gin Ser Leu Cys Pro Val He 
1 5 10 15 

Arg Phe Val Gly Thr Lys Thr Leu Arg Glu Val Phe Gin Val Gly Arg 

20 25 30 

Tyr Leu Cys Leu His Asp Val Arg Arg Gly Arg Ser His Arg Glu Gly 
40 35 40 45 

Ala Thr Arg Ser Asp 
50 

(2) INFORMATION FOR SEQ ID NO: 129: 
(i) SEQUENCE CHARACTERISTICS: 

4 5 (A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

50 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 52 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024538 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
55 Met He Leu Gly Asp Val Glu Glu Thr He Thr Thr Val Glu He Asp 
15 10 15 

Asp Glu Thr Tyr Glu Glu He Val Arg Thr Thr Lys Arg Thr He Glu 

20 25 30 

Phe Leu Phe Val Arg Gly Asp Gly Val He Leu Val Ser Pro Pro Leu 
60 35 40 45 

Arg Thr Ala Ala 
50 
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(2) INFORMATION FOR SEQ ID NO: 130: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
10 (B) LOCATION: 1..797 

(D) OTHER INFORMATION: / Ceres Seq. ID 1025683 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 
aacccttaaa ctccgttacc ctaaacctca agccgtcgtc tccaaagttc cccctcagag 60 
acgaaaaaat gaagaccatt ttgtcctcgg agactatgga catccccgac ggcgtcgcca 120 
15 ttaaggtaaa cgccaaggtg attgaggtcg aaggtccacg aggtaaactc actcgtgact 180 
tcaagcatct gaatctcgat ttccagttga ttaaagacca agtcactgga aaacgtcagc 240 
ttaagattga ttcttggttt ggttctcgta agacaagtgc ttcgattaga actgctttaa 300 
gccatgttga taatctcatt gctggtgtta ctcaaggttt tctttataga atgagatttg 360 
tgtatgctca ttttcctatc aatgcttcta ttgatggtaa caataagagt attgagattc 420 
20 gtaacttcct tggtgagaag aaggtgagga aggttgagat gttggatggt gttaagattg 480 
ttcgatctga gaaggttaag gatgagatta ttcttgaggg aaatgatatt gagcttgttt 540 
cacggtcttg tgctttgatc aatcagaaat gtcatgtgaa gaagaaggat atcaggaagt 600 
ttcttgatgg tatctatgtg agcgagaaag gcaagatcgc agtcgaggaa tgaagttcgc 660 
tattgaaagt tctatattag attttatatg gtgaagcagt gatattatcg tagctttttg 720 
25 ttttacatct ttcttatgga tttgtagttt tgtttttcct tttgaatcat tgatgaatgc 780 
aaattcatta tccgttc 

(2) INFORMATION FOR SEQ ID NO: 131: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

35 (A) NAME /KEY : peptide 

(B) LOCATION: 1..216 

(D) OTHER INFORMATION: / Ceres Seq. ID 1025684 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 





Pro 


Leu 


Asn 


Ser 


Val 


Thr 


Leu 


Asn 


Leu 


Lys 


Pro 


Ser 


Ser 


Pro 


Lys 


Phe 


40 


1 








5 










10 










15 






Pro 


Leu 


Arg 


Asp 
20 


Glu 


Lys 


Met 


Lys 


Thr 

25 


He 


Leu 


Ser 


Ser 


Glu 
30 


Thr 


Met 




Asp 


He 


Pro 
35 


Asp 


Gly 


Val 


Ala 


He 

40 


Lys 


Val 


Asn 


Ala 


Lys 
45 


Val 


He 


Glu 


45 


Val 


Glu 
50 


Gly 


Pro 


Arg 


Gly 


Lys 
55 


Leu 


Thr 


Arg 


Asp 


Phe 
60 


Lys 


His 


Leu 


Asn 




Leu 


Asp 


Phe 


Gin 


Leu 


He 


Lys 


Asp 


Gin 


Val 


Thr 


Gly 


Lys 


Arg 


Gin 


Leu 




65 










70 










75 










80 




Lys 


He 


Asp 


Ser 


Trp 


Phe 


Gly 


Ser 


Arg 


Lys 


Thr 


Ser 


Ala 


Ser 


He 


Arg 


50 










85 










90 










95 






Thr 


Ala 


Leu 


Ser 
100 


His 


Val 


Asp 


Asn 


Leu 
105 


He 


Ala 


Gly 


Val 


Thr 
110 


Gin 


Gly 




Phe 


Leu 


Tyr 
115 


Arg 


Met 


Arg 


Phe 


Val 
120 


Tyr 


Ala 


His 


Phe 


Pro 
125 


He 


Asn 


Ala 


55 


Ser 


He 
130 


Asp 


Gly 


Asn 


Asn 


Lys 
135 


Ser 


He 


Glu 


He 


Arg 
140 


Asn 


Phe 


Leu 


Gly 




Glu 


Lys 


Lys 


Val 


Arg 


Lys 


Val 


Glu 


Met 


Leu 


Asp 


Gly 


Val 


Lys 


He 


Val 




145 










150 










155 










160 




Arg 


Ser 


Glu 


Lys 


Val 


Lys 


Asp 


Glu 


He 


He 


Leu 


Glu 


Gly 


Asn 


Asp 


He 


60 










165 










170 










175 






Glu 


Leu 


Val 


Ser 
180 


Arg 


Ser 


Cys 


Ala 


Leu 
185 


He 


Asn 


Gin 


Lys 


Cys 
190 


His 


Val 
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Lys Lys Lys Asp lie Arg Lys Phe Leu Asp Gly lie Tyr Val Ser Glu 

195 200 205 

Lys Gly Lys lie Ala Val Glu Glu 
210 215 
5 (2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino "acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 194 

15 (D) OTHER INFORMATION: / Ceres Seq. ID 1025685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
Met Lys Thr lie Leu Ser Ser Glu Thr Met Asp lie Pro Asp Gly Val 
15 10 15 

Ala lie Lys Val Asn Ala Lys Val lie Glu Val Glu Gly Pro Arg Gly 
20 20 25 30 

Lys Leu Thr Arg Asp Phe Lys His Leu Asn Leu Asp Phe Gin Leu lie 

35 ' " 4 0 4 5 

Lys Asp Gin Val Thr Gly Lys Arg Gin Leu Lys lie Asp Ser Trp Phe 
50 55 60 

2 5 Gly Ser Arg Lys Thr Ser Ala Ser lie Arg Thr Ala Leu Ser His Val 
65 70 75 80 

Asp Asn Leu lie Ala Gly Val Thr Gin Gly Phe Leu Tyr Arg Met Arg 

85 90 95 

Phe Val Tyr Ala His Phe Pro lie Asn Ala Ser lie Asp Gly Asn Asn 
30 100 105 110 

Lys Ser lie Glu lie Arg Asn Phe Leu Gly Glu Lys Lys Val Arg Lys 

115 120 125 

Val Glu Met Leu Asp Gly Val Lys lie Val Arg Ser Glu Lys Val Lys 
130 135 140 

35 Asp Glu lie lie Leu Glu Gly Asn Asp lie Glu Leu Val Ser Arg Ser 
145 150 155 160 

Cys Ala Leu lie Asn Gin Lys Cys His Val Lys Lys Lys Asp lie Arg 

165 ' 170 * ~ 175 

Lys Phe Leu Asp Gly lie Tyr Val Ser Glu Lys Gly Lys lie Ala Val 
4 0 180 "* 185 " 190 

Glu Glu 

(2) INFORMATION FOR SEQ ID NO: 133: 
(i) SEQUENCE CHARACTERISTICS: 

4 5 (A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..18 5 

(D) OTHER INFORMATION: / Ceres Seq. ID 1025686 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
55 Met Asp lie Pro Asp Gly Val Ala lie Lys Val Asn Ala Lys Val lie 
15 10 15 

Glu Val Glu Gly Pro Arg Gly Lys Leu Thr Arg Asp Phe Lys His Leu 

20 25 30 

Asn Leu Asp Phe Gin Leu lie Lys Asp Gin Val Thr Gly Lys Arg Gin 
60 35 40 45 

Leu Lys lie Asp Ser Trp Phe Gly Ser Arg Lys Thr Ser Ala Ser lie 
50 ' 55 " ' 60 
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Arg 


Thr 


Ala 


Leu 


Ser 


His 


Val 


Asp 


As n 


Leu 


He 


Ala 


Glv 


Val 


Thr 


Gin 


65 










70 










75 










80 


Glv 


Phe 


Leu 


T vr 


Arg 


Met 


Arg 


Phe 


Val 


T vr 
j 


Ala 


His 


Phe 


Pro 


He 


Asn 










85 










90 










95 




Ala 


Ser 


He 


Asp 


Glv 


Asn 


Asn 


L vs 


Ser 


He 


Glu 


He 


Arg 


Asn 


Phe 


Leu 








100 










105 










110 






Gly 


Glu 


Lys 


Lys 


Val 


Arg 


Lys 


Val 


Glu 


Met 


Leu 


Asp 


Glv 


Val 


Lys 


He 






115 










120 










125 








Val 


Arg 


Ser 


Glu 


Lys 


Val 


Lys 


Asp 


Glu 


He 


He 


Leu 


Glu 


Gly 


Asn 


Asp 




130 










135 










140 










He 


Glu 


Leu 


Val 


Ser 


Arg 


Ser 


Cys 


Ala 


Leu 


He 


Asn 


Gin 


Lys 


Cys 


His 


145 










150 










155 










160 


Val 


Lys 


Lys 


Lys 


Asp 


He 


Arg 


Lys 


Phe 


Leu 


Asp 


Gly 


He 


Tyr 


Val 


Ser 










165 










170 










175 




Glu 


Lys 


Gly 


Lys 


He 


Ala 


Val 


Glu 


Glu 

















180 185 



(2) INFORMATION FOR SEQ ID NO: 134: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 555 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027152 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



ggttttgtca tcgtgctcct cttctcctcc accgcacaga tccaagacat tttgattaca 60 

ctccatcgcc ggcgaaaatg gtgacggaaa agaaaacgaa gaagtctcat gagggtatca 120 

acagtaggtt ggctcttgtt atgaagagtg gtaaatacac tcttggttac aagtctgttc 180 

tcaaatccct ccgtggctcc aaagggaaac tgsttctaat ctccaccaat tgcccaccgt 240 

tgagaagatc agagattgag tactacgcaa tgctcgctaa agttggtgtc caccattaca 300 

atggaaataa cgttgatttg ggaacagctt gcggaaagta cttccgtgtt tcttgtctga 360 

gcattgttga tcctggtgat tctgacatta tcaagtcaat tcctggtgac cagtgataca 420 

attttcagat tttatttttc ccagactttt tacccgtctt tgtattcgag tctctcaaaa 480 

ttttatgagc aatagacatt gtgtttcttc tttgtcaagt tttcttatat tgaggaccgg 540 



aaaagaaatt tcttt 

(2) INFORMATION FOR SEQ ID NO: 135: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..137 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027153 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 



Phe 


Cys 


His 


Arg 


Ala 


Pro 


Leu 


Leu 


Leu 


His 


Arg 


Thr 


Asp 


Pro 


Arg 


His 


1 








5 










10 










15 




Phe 


Asp 


Tyr 


Thr 
20 


Pro 


Ser 


Pro 


Ala 


Lys 
25 


Met 


Val 


Thr 


Glu 


Lys 
30 


Lys 


Thr 


Lys 


Lys 


Ser 
35 


His 


Glu 


Gly 


He 


Asn 
40 


Ser 


Arg 


Leu 


Ala 


Leu 
45 


Val 


Met 


Lys 


Ser 


Gly 
50 


Lys 


Tyr 


Thr 


Leu 


Gly 
55 


Tyr 


Lys 


Ser 


Val 


Leu 
60 


Lys 


Ser 


Leu 


Arg 


Gly 


Ser 


Lys 


Gly 


Lys 


Leu 


He 


Leu 


He 


Ser 


Thr 


Asn 


Cys 


Pro 


Pro 


Leu 


65 










70 










75 










80 


Arg 


Arg 


Ser 


Glu 


He 
85 


Glu 


Tyr 


Tyr 


Ala 


Met 
90 


Leu 


Ala 


Lys 


Val 


Gly 
95 


Val 
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His His Tyr Asn Gly Asn Asn Val Asp Leu Gly Thr Ala Cys Gly Lys 

100 105 ~ 110 

Tyr Phe Arg Val Ser Cys Leu Ser lie Val Asp Pro Gly Asp Ser Asp 

115 ' 120 125 

lie lie Lys Ser lie Pro Gly Asp Gin 

130 135 
(2) INFORMATION FOR SEQ ID NO: 136: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..112 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027154 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 



Met 


Val 


Thr 


Glu 


Lys 


Lys 


Thr 


Lys 


Lys 


Ser 


His 


Glu 


Gly 


lie 


Asn 


Ser 


1 








5 










10 










15 




Arg 


Leu 


Ala 


Leu 


Val 


Met 


Lys 


Ser 


Gly 


Lys 


Tyr 


Thr 


Leu 


Gly 


Tyr 


Lys 








20 










25 










30 






Ser 


Val 


Leu 


Lys 


Ser 


Leu 


Arg 


Gly 


Ser 


Lys 


Gly 


Lys 


Leu 


He 


Leu 


He 






35 










40 










45 








Ser 


Thr 


Asn 


Cys 


Pro 


Pro 


Leu 


Arg 


Arg 


Ser 


Glu 


He 


Glu 


Tyr 


Tyr 


Ala 




50 










55 










60 










Met 


Leu 


Ala 


Lys 


Val 


Gly 


Val 


His 


His 


Tyr 


Asn 


Gly 


Asn 


Asn 


Val 


Asp 


65 










70 










75 










80 


Leu 


Gly 


Thr 


Ala 


Cys 


Gly 


Lys 


Tyr 


Phe 


Arg 


Val 


Ser 


Cys 


Leu 


Ser 


He 










85 










90 










95 




Val 


Asp 


Pro 


Gly 


Asp 


Ser 


Asp 


He 


He 


Lys 


Ser 


He 


Pro 


Gly 


Asp 


Gin 








100 










105 










110 






(2) 


I N FORMAT I ON 


FOR 


SEQ 


id ; 


NO: 137: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..91 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027155 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 137 












Met 


Lys 


Ser Gly 


Lys 


Tyr 


Thr Leu 


Gly 


Tyr 


Lys 


Ser 


Val 


Leu 


Lys 


Ser 


1 






5 








10 










15 




Leu 


Arg 


Gly Ser 


Lys 


Gly 


Lys Leu 


lie 


Leu 


He 


Ser 


Thr 


Asn 


Cys 


Pro 






20 








25 










30 






Pro 


Leu 


Arg Arg 


Ser 


Glu 


lie Glu 


Tyr 


Tyr 


Ala 


Met 


Leu 


Ala 


Lys 


Val 






35 






40 










45 








Gly 


Val 


His His 


Tyr 


Asn 


Gly Asn 


Asn 


Val 


Asp 


Leu 


Gly 


Thr 


Ala 


Cys 




50 








55 








60 










Gly 


Lys 


Tyr Phe 


Arg 


Val 


Ser Cys 


Leu 


Ser 


He 


Val 


Asp 


Pro 


Gly 


Asp 


65 








70 








75 










80 


Ser 


Asp 


lie lie 


Lys 


Ser 


lie Pro 


Gly 


Asp 


Gin 


















85 








90 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 138 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(ii) 
(ix) 



(xi) 



389 

MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..653 

(D) OTHER INFORMATION: / Ceres Seq. ID 1028095 
SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
aaataaagtt tttatttatc tatctatcca attggagacg aatcatatcg agcgatggat 
ttggcagaac tgtgggcgat tttcggaccc ggattctccg gcgccgtttt cggaaccggg 
tggtggtttt gggtcgacgc cgtcgtttgc agttccatcc aagttccctt cgtccattac 
cttcccggca tattcgcttc tctcggagct ttgatgttca attgcgtcag aaaagaagac 
attgattact ctccttacga cgaaggcgag tggagattga agctgtggct tttcatagcg 
tatgtcgtag catttgtttc cttagctgct tctgttggct tgctgattca agattcggtt 
gtgaagactg ggccttcaac ttggactggt gtggctggtg tctttcaatg tgtatttgta 
ttgataagtg ggctaatgta ttggacatcg cactcagagt agcgcacagg cacatacttc 
atctccgact cagattcata ctgtgacaac ttacaagttc ctctcgtttc tctactatgt 
aaattcctgt gtcttgtctt gtcaccttgg cattgtgtgc gagattcctc tgttgcatca 
gtgtgtgttt ctgttaaaac tttgtgtgaa tgttattgaa aatactccaa ate 
(2) INFORMATION FOR SEQ ID NO: 139: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



(xi) 



MOLECULE TYPE 
FEATURE: 

(A) NAME /KEY 

(B) LOCATION 



peptide 
1. .135 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



ID 1028096 



Met 


Asp 


Leu 


Ala 


Glu 


Leu 


Trp 


Ala 


He 


Phe 


Gly 


Pro 


Gly 


Phe 


Ser 


Gly 


1 








5 










10 










15 




Ala 


Val 


Phe 


Gly 


Thr 


Gly 


Trp 


Trp 


Phe 


Trp 


Val 


Asp 


Ala 


Val 


Val 


Cys 








20 










25 










30 






Ser 


Ser 


He 


Gin 


Val 


Pro 


Phe 


Val 


His 


Tyr 


Leu 


Pro 


Gly 


He 


Phe 


Ala 






35 










40 










45 








Ser 


Leu 


Gly 


Ala 


Leu 


Met 


Phe 


Asn 


Cys 


Val 


Arg 


Lys 


Glu 


Asp 


He 


Asp 




50 










55 










60 










Tyr 


Ser 


Pro 


Tyr 


Asp 


Glu 


Gly 


Glu 


Trp 


Arg 


Leu 


Lys 


Leu 


Trp 


Leu 


Phe 


65 










70 










75 










80 


He 


Ala 


Tyr 


Val 


Val 


Ala 


Phe 


Val 


Ser 


Leu 


Ala 


Ala 


Ser 


Val 


Gly 


Leu 










85 










90 










95 




Leu 


He 


Gin 


Asp 


Ser 


Val 


Val 


Lys 


Thr 


Gly 


Pro 


Ser 


Thr 


Trp 


Thr 


Gly 








100 










105 










110 






Val 


Ala 


Gly 


Val 


Phe 


Gin 


Cys 


Val 


Phe 


Val 


Leu 


lie 


Ser 


Gly 


Leu 


Met 






115 










120 










125 








Tyr 


Trp 


Thr 


Ser 


His 


Ser 


Glu 






















130 










135 




















(2) 


INFORMATION 


FOR 


SEQ 


ID 1 


NO: 140: 

















(i) 



(ii) 
(ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



Met 
1 



peptide 
1. .82 

(D) OTHER INFORMATION: / Ceres Seq. ID 1028097 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

Phe Asn Cys Val Arg Lys Glu Asp He Asp Tyr Ser Pro Tyr Asp 
5 10 15 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Glu Gly Glu Trp Arg Leu Lys Leu Trp Leu Phe He Ala Tyr Val Val 
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20 








25 










30 






Ala 


Phe Val Ser 


Leu 


Ala 


Ala 


Ser Val 


Gly 


Leu 


Leu 


He 


Gin 


Asp 


Ser 




35 








4 0 








4 5 








Val 


Val Lys Thr 


Gly 


Pro 


Ser 


Thr Trp 


Thr 


Gly 


Val 


Ala 


Gly 


Val 


Phe 




50 






55 








60 










Gin 


Cys Val Phe 


Val 


Leu 


He 


Ser Gly 


Leu 


Met 


Tyr 


Trp 


Thr 


Ser 


His 


65 






70 








75 










80 


Ser 


Glu 
























(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO:141: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 



(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 80 

(D) OTHER INFORMATION: / Ceres Seq. ID 1028098 





(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO: 141: 










He 


Lys 


Phe Leu 


Phe He 


Tyr Leu Ser 


Asn 


Trp 


Arg 


Arg 


He 


He 


Ser 


1 






5 




10 










15 




Ser 


Asp 


Gly Phe 


Gly Arg 


Thr Val Gly 


Asp 


Phe 


Arg 


Thr 


Arg 


He 


Leu 






20 




25 










30 






Arg 


Arg 


Arg Phe 


Arg Asn 


Arg Val Val 


Val 


Leu 


Gly Arg Arg 


Arg 


Arg 






35 




40 








45 








Leu 


Gin 


Phe His 


Pro Ser 


Ser Leu Arg 


Pro 


Leu 


Pro 


Ser 


Arg 


His 


He 




50 






55 






60 










Arg 


Phe 


Ser Arg 


Ser Phe 


Asp Val Gin 


Leu 


Arg 


Gin 


Lys 


Arg 


Arg 


His 


65 






70 






75 










80 


(2) 


INFORMATION 


FOR SEQ 


ID NO-.142: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .779 

(D) OTHER INFORMATION: / Ceres Seq. ID 1028608 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



gtctaaactc taaactctaa agactttgtc atttcgtgaa ccgtcgtcat aactgatcct 60 

tccaaagaat ctccaaaatc cctaaacttc gaaaatggca gaaaccgccg agacaatcaa 120 

cacca.cqa.tc tcatcqccgc caccqgaatc cqaaagctcc accactattt ctgcaatgac 180 

agatccaaca tctcaagaag cagcctcaaa agacacagat ctgacgaaag aagccgaatc 240 

agagaagaaa ccaggaggaa tctctctccg aatctggcca ccgactcaga aaactcgcga 300 

cgccgttctg aatcgcctga tcgagacctt atccaccgaa tcaatcctct ctaagagata 360 

cggtactctt aaatctgacg atgcaaccac cgtcgcgaaa cttatcgaag aagaggctta 420 

tggtgttgct tcgaatgctg tgtcgagtga tgatgatggg attaagattc ttgagcttta 480 

ttctaaagag attagtaaga ggatgcttga atctgttaag gctagatcta atgctagtgt 540 

tggaaatgga agtgtggagg atgctaatac tgatgctagt gaggtttcta aagatgatgc 600 

tggtcctggt tcagaggagg agaagagtga ggcttgagaa atttggtttt acttccttat 660 

tataatctgt tgtctcttag ggttttgctg ttcttgtaat gatgatgatg acttgtggat 720 



ttggtattat gcatatttga agatgtttct tcttccttaa tatgagtgca agtgtcttc 
(2) INFORMATION FOR SEQ ID NO : 1 4 3 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..180 

5 (D) OTHER INFORMATION: / Ceres Seq. ID 1028609 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
Met Ala Glu Thr Ala Glu Thr lie Asn Thr Thr lie Ser Ser Pro Pro 
15 10 15 

Pro Glu Ser Glu Ser Ser Thr Thr lie Ser Ala Met Thr Asp Pro Thr 
10 20 25 30 

Ser Gin Glu Ala Ala Ser Lys Asp Thr Asp Leu Thr Lys Glu Ala Glu 

35 40 45 

Ser Glu Lys Lys Pro Gly Gly lie Ser Leu Arg lie Trp Pro Pro Thr 
50 55 60 

15 Gin Lys Thr Arg Asp Ala Val Leu Asn Arg Leu lie Glu Thr Leu Ser 
65 " 70 75 80 

Thr Glu Ser lie Leu Ser Lys Arg Tyr Gly Thr Leu Lys Ser Asp Asp 

85 90 95 

Ala Thr Thr Val Ala Lys Leu lie Glu Glu Glu Ala Tyr Gly Val Ala 
20 100 105 110 

Ser Asn Ala Val Ser Ser Asp Asp Asp Gly lie Lys lie Leu Glu Leu 

115 120 125 

Tyr Ser Lys Glu lie Ser Lys Arg Met Leu Glu Ser Val Lys Ala Arg 
130 135 140 

2 5 Ser Asn Ala Ser Val Gly Asn Gly Ser Val Glu Asp Ala Asn Thr Asp 
145 150 ~ 155 160 

Ala Ser Glu Val Ser Lys Asp Asp Ala Gly Pro Gly Ser Glu Glu Glu 
165 170 175 

Lys Ser Glu Ala 
30 ~ 180 

(2) INFORMATION FOR SEQ ID NO: 144: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
35 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
40 (B) LOCATION: 1..153 

(D) OTHER INFORMATION: / Ceres Seq. ID 1028610 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
Met Thr Asp Pro Thr Ser Gin Glu Ala Ala Ser Lys Asp Thr Asp Leu 
15 10 15 

4 5 Thr Lys Glu Ala Glu Ser Glu Lys Lys Pro Gly Gly lie Ser Leu Arg 

20 25 30 

lie Trp Pro Pro Thr Gin Lys Thr Arg Asp Ala Val Leu Asn Arg Leu 

35 40 45 

lie Glu Thr Leu Ser Thr Glu Ser lie Leu Ser Lys Arg Tyr Gly Thr 
50 50 55 60 

Leu Lys Ser Asp Asp Ala Thr Thr Val Ala Lys Leu lie Glu Glu Glu 
65 70 75 80 

Ala Tyr Gly Val Ala Ser Asn Ala Val Ser Ser Asp Asp Asp Gly lie 
85 90 95 

55 Lys lie Leu Glu Leu Tyr Ser Lys Glu lie Ser Lys Arg Met Leu Glu 

100 105 110 

Ser Val Lys Ala Arg Ser Asn Ala Ser Val Gly Asn Gly Ser Val Glu 

115 120 125 

Asp Ala Asn Thr Asp Ala Ser Glu Val Ser Lys Asp Asp Ala Gly Pro 
60 ' 130 135 140 

Gly Ser Glu Glu Glu Lys Ser Glu Ala 
145 150 
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(2) INFORMATION FOR SEQ ID NO: 145: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 94 

(D) OTHER INFORMATION: / Ceres Seq. ID 1028611 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



Met 


Gin 


Pro 


Pro 


Ser 


Arg 


Asn 


Leu 


Ser 


Lys 


Lys 


Arg 


Leu 


Met 


Val Leu 


1 








5 










10 










15 


Leu 


Arg 


Met 


Leu 


Cys 


Arg 


Val 


Met 


Met 


Met 


Gly 


Leu 


Arg 


Phe 


Leu Ser 








20 










25 










30 




Phe 


lie 


Leu 


Lys 


Arg 


Leu 


Val 


Arg 


Gly 


Cys 


Leu 


Asn 


Leu 


Leu 


Arg Leu 






35 










40 










45 






Asp 


Leu 


Met 


Leu 


Val 


Leu 


Glu 


Met 


Glu 


Val 


Trp 


Arg 


Met 


Leu 


lie Leu 




50 










55 










60 








Met 


Leu 


Val 


Arg 


Phe 


Leu 


Lys 


Met 


Met 


Leu 


Val 


Leu 


Val 


Gin 


Arg Arg 


65 










70 










75 








80 


Arg 


Arg 


Val 


Arg 


Leu 


Glu 


Lys 


Phe 


Gly 


Phe 


Thr 


Ser 


Leu 


Leu 





85 90 
(2) INFORMATION FOR SEQ ID NO: 146: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..673 

(D) OTHER INFORMATION: / Ceres Seq. ID 1030069 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 



atccaaaaaa aacaaaacaa aaaattatat tcaagagaaa aaggaaaaaa tgaatttcat 60 

ctccgatcag gtaaagaaac tctcaagctc aacaccagag gagccagacc acaacaagcc 120 

agtcgaagga accgaaacag ctacaagacc agctaccaac gccgagctca tggcaagtgc 180 

caaggttgta gctgaagctg ctcaagccgc agctcgtaac gaatcagaca aactcgacaa 240 

gggtaaagtc gccggagcct ctgctgatat cttagacgct tccgagaaat acggtaagtt 300 

cgatgaaaag agtagcactg gtcactacct cgacaaggct gagaagtatc tcaacgacta 360 

cgagtcgtca cactccaccg gtgctggtgg tcctcctcct ccgacgagtc aggctgagcc 420 

agcaagtcag cctgagccgg cggctaagaa agacgatgaa gagtctggtg gtgggcttgg 480 

aggttatgcc aagatggctc aaggtttctt gaagtgattt gatctttaat tgttgttcat 540 

cattttcgta ataataaatt aaataactag tatcgtttgt gactagttta tgttgcttcg 600 

tttatgttta tggggagtga cgagtgagtg taataacttc tggtgatcat gaatctaatc 660 



catctttgtt gtg 

(2) INFORMATION FOR SEQ ID NO : 1 4 7 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: 1 . . 171 

(D) OTHER INFORMATION: / Ceres Seq. ID 1030070 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
Ser Lys Lys Asn Lys Thr Lys Asn Tyr lie Gin Glu Lys Lys Glu Lys 
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Met Asn Phe lie Ser Asp Gin Val Lys 

20 25 
Glu Glu Pro Asp His Asn Lys Pro Val 

35 ~ 40 

Arg Pro Ala Thr Asn Ala Glu Leu Met 

50 55 
Glu Ala Ala Gin Ala Ala Ala Arg Asn 
65 70 
Gly Lys Val Ala Gly Ala Ser Ala Asp 
85 

Tyr Gly Lys Phe Asp Glu Lys Ser Ser 
100 105 
Ala Glu Lys Tyr Leu Asn Asp Tyr Glu 

115 120 
Gly Gly Pro Pro Pro Pro Thr Ser Gin 

130 135 
Glu Pro Ala Ala Lys Lys Asp Asp Glu 
145 150 
Gly Tyr Ala Lys Met Ala Gin Gly Phe 
165 

(2) INFORMATION FOR SEQ ID NO: 14 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino aci 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..155 
(D) OTHER INFORMATION: / Ceres Seq. ID 1030071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



Met 


Asn 


Phe 


He 


Ser 


Asp 


Gin 


Val 


Lys 


Lys 


Leu 


Ser 


Ser 


Ser 


Thr 


Pro 


1 








5 










10 










15 




Glu 


Glu 


Pro 


Asp 


His 


Asn 


Lys 


Pro 


Val 


Glu 


Gly 


Thr 


Glu 


Thr 


Ala 


Thr 








20 










25 










30 






Arg 


Pro 


Ala 


Thr 


Asn 


Ala 


Glu 


Leu 


Met 


Ala 


Ser 


Ala 


Lys 


Val 


Val 


Ala 






35 










40 










45 








Glu 


Ala 


Ala 


Gin 


Ala 


Ala 


Ala 


Arg 


Asn 


Glu 


Ser 


Asp 


Lys 


Leu 


Asp 


Lys 




50 










55 










60 










Gly 


Lys 


Val 


Ala 


Gly 


Ala 


Ser 


Ala 


Asp 


He 


Leu 


Asp 


Ala 


Ser 


Glu 


Lys 


65 










70 










75 










80 


Tyr 


Gly 


Lys 


Phe 


Asp 


Glu 


Lys 


Ser 


Ser 


Thr 


Gly 


His 


Tyr 


Leu 


Asp 


Lys 










85 










90 










95 




Ala 


Glu 


Lys 


Tyr 


Leu 


Asn 


Asp 


Tyr 


Glu 


Ser 


Ser 


His 


Ser 


Thr 


Gly Ala 








100 










105 










110 






Gly 


Gly 


Pro 


Pro 


Pro 


Pro 


Thr 


Ser 


Gin 


Ala 


Glu 


Pro 


Ala 


Ser 


Gin 


Pro 






115 










120 










125 








Glu 


Pro 


Ala 


Ala 


Lys 


Lys 


Asp 


Asp 


Glu 


Glu 


Ser 


Gly 


Gly 


Gly 


Leu 


Gly 




130 










135 










140 










Gly 


Tyr 


Ala 


Lys 


Met 


Ala 


Gin 


Gly 


Phe 


Leu 


Lys 













145 150 155 

(2) INFORMATION FOR SEQ ID NO: 149: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..115 



393 



Lys 


Leu 


Q £i -p~ 


Q -K- 


O -L 

30 


± I J.-L 


P t 


Glu 


Gly 


Thr 


Glu 
45 


Thr 


Ala 


Thr 




OCX 


Ala 
60 


Lys 


Val 


Val 


Ala 


Glu 


Ser 
75 


As p 


Lys 


Leu 


Asp 


Lys 
80 


Tip 
lie 


Leu 


Aqn 




JCl 


O-L Li 


L ys 


90 










95 




Thr 

1 1 1 -L 


m w 

uiy 


Ui e 

nio 


i yr 


Leu 
110 




Lys 


Ser 


Ser 


His 


Ser 
125 


Thr 


Gly 


Ala 


Ala 


Glu 


Pro 
140 


Ala 


Ser 


Gin 


Pro 


Glu 


Ser 
155 


Gly 


Gly 


Gly 


Leu 


Gly 
160 


Leu 


Lys 













170 



ds 
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(D) OTHER INFORMATION: / Ceres Seq. ID 1030072 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
Met Ala Ser Ala Lys Val Val Ala Glu Ala Ala Gin Ala Ala Ala Arg 
15 10 15 

Asn Glu Ser Asp Lys Leu Asp Lys Gly Lys Val Ala Gly Ala Ser Ala 

20 ^ 25 30 

Asp lie Leu Asp Ala Ser Glu Lys Tyr Gly Lys Phe Asp Glu Lys Ser 

35 40 45 

Ser Thr Gly His Tyr Leu Asp Lys Ala Glu Lys Tyr Leu Asn Asp Tyr 

50 55 60 

Glu Ser Ser His Ser Thr Gly Ala Gly Gly Pro Pro Pro Pro Thr Ser 
65 70 " 75 80 

Gin Ala Glu Pro Ala Ser Gin Pro Glu Pro Ala Ala Lys Lys Asp Asp 

85 90 95 

Glu Glu Ser Gly Gly Gly Leu Gly Gly Tyr Ala Lys Met Ala Gin Gly 
100 ' ~ 105 " " 110 

Phe Leu Lys 
115 

(2) INFORMATION FOR SEQ ID NO: 150: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
DNA (genomic) 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



1. . 944 

(D) OTHER INFORMATION: / Ceres Seq. ID 1032069 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

attgcgaatt ttaattgatc actctctctc tctatctctg ttcaaacatc tctcccaaag 60 

taggtagaag aagatgggga cgacgttaga tgtatcaaga gcagagctag cacttgtggt 120 

aatgtatttg aacaaagcag aggcaagaga taagttatgc agagctatac agtatggttc 180 

caagttcttg agtggtggac aacctggtac tgctcaaaat gttgacaaat ctactagctt 240 

agcaagaaaa gtcttccgtc ttttcaagtt tgtgaatgac ttgcatggtc ttatcagtcc 300 

tgtgcctaaa ggaactcctc ttcctcttgt tttacttgga aagtcgaaga acgcactttt 360 

atctacattc ttgttcctgg atcaaattgt ctggcttggg agatcaggaa tatataagaa 420 

caaagaacga gctgagttac ttggacgtat atctctcttc tgctggatgg gatcttctgt 480 

ctgcacaact ttagtcgagg ttggtgagat gggaaggctt tcttcatcaa tgaagaagat 540 

cgaaaaggga ctcaagaatg gaaacaagta tcaggatgag gattatcgtg ctaagctaaa 600 

aaaatcaaac gagaggtcac ttgctttgat caaatcagct atggacattg ttgtagcagc 660 

tggtcttctt cagttagctc caacgaagat cactcctcgt gtcaccggag cttttggatt 720 

catcacctcc atcatttctt gttaccagtt gcttccgaca cgccccaaga tcaaaacacc 780 

ctgaagtcaa tcgaggaagc tggtgttaag gagaaagtca tttcagaaag tttctgttct 840 

tcttattatt aaggaaattc actgctgtta taaaacatcg tataaaatac ttatttgcag 900 

gaattacggt tattataaat ttatgcctca gattcttgat acgc 
(2) INFORMATION FOR SEQ ID NO: 151: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



(xi) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..236 

(D) OTHER INFORMATION: / Ceres Seq. ID 1032070 
SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
Met Gly Thr Thr Leu Asp Val Ser Arg Ala Glu Leu Ala Leu Val Val 
15 10 15 

Met Tyr Leu Asn Lys Ala Glu Ala Arg Asp Lys Leu Cys Arg Ala lie 
20 ^ 25 30 
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Gin Tyr Gly Ser Lys Phe Leu Ser Gly Gly Gin Pro Gly Thr Ala Gin 

35 40 45 

Asn Val Asp Lys Ser Thr Ser Leu Ala Arg Lys Val Phe Arg Leu Phe 

50 55 * 60 

Lys Phe Val Asn Asp Leu His Gly Leu lie Ser Pro Val Pro Lys Gly 
65 70 " 75 80 

Thr Pro Leu Pro Leu Val Leu Leu Gly Lys Ser Lys Asn Ala Leu Leu 

85 " 90 ' 95 

Ser Thr Phe Leu Phe Leu Asp Gin lie Val Trp Leu Gly Arg Ser Gly 

100 " 105 ~ 110 

lie Tyr Lys Asn Lys Glu Arg Ala Glu Leu Leu Gly Arg lie Ser Leu 

115 120 125 

Phe Cys Trp Met Gly Ser Ser Val Cys Thr Thr Leu Val Glu Val Gly 

130 135 ' 140 

Glu Met Gly Arg Leu Ser Ser Ser Met Lys Lys lie Glu Lys Gly Leu 
145 150 155 160 

Lys Asn Gly Asn Lys Tyr Gin Asp Glu Asp Tyr Arg Ala Lys Leu Lys 

165 ' 170 175 

Lys Ser Asn Glu Arg Ser Leu Ala Leu lie Lys Ser Ala Met Asp lie 

180 185 190 

Val Val Ala Ala Gly Leu Leu Gin Leu Ala Pro Thr Lys lie Thr Pro 

195 200 205 

Arg Val Thr Gly Ala Phe Gly Phe lie Thr Ser lie lie Ser Cys Tyr 

210 215 220 

Gin Leu Leu Pro Thr Arg Pro Lys lie Lys Thr Pro 
225 230 ' 235 

(2) INFORMATION FOR SEQ ID NO: 152: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .220 

(D) OTHER INFORMATION: / Ceres Seq. ID 1032071 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
Met Tyr Leu Asn Lys Ala Glu Ala Arg Asp Lys Leu Cys Arg Ala lie 
15 10 "* 15 

Gin Tyr Gly Ser Lys Phe Leu Ser Gly Gly Gin Pro Gly Thr Ala Gin 

20 25 30 

Asn Val Asp Lys Ser Thr Ser Leu Ala Arg Lys Val Phe Ar_g Leu Phe 

35 4 0 " 4 5 

Lys Phe Val Asn Asp Leu His Gly Leu lie Ser Pro Val Pro Lys Gly 

50 55 60 

Thr Pro Leu Pro Leu Val Leu Leu Gly Lys Ser Lys Asn Ala Leu Leu 
65 70 " J 75 80 

Ser Thr Phe Leu Phe Leu Asp Gin lie Val Trp Leu Gly Arg Ser Gly 

85 90 95 

lie Tyr Lys Asn Lys Glu Arg Ala Glu Leu Leu Gly Arg lie Ser Leu 

100 105 110 

Phe Cys Trp Met Gly Ser Ser Val Cys Thr Thr Leu Val Glu Val Gly 

115 " 120 125 

Glu Met Gly Arg Leu Ser Ser Ser Met Lys Lys lie Glu Lys Gly Leu 

130 135 ~ 140 

Lys Asn Gly Asn Lys Tyr Gin Asp Glu Asp Tyr Arg Ala Lys Leu Lys 
145 150 155 160 

Lys Ser Asn Glu Arg Ser Leu Ala Leu lie Lys Ser Ala Met Asp lie 

165 170 175 

Val Val Ala Ala Gly Leu Leu Gin Leu Ala Pro Thr Lys lie Thr Pro 
180 ~ 185 190 
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Arg Val Thr Gly Ala Phe Gly Phe lie Thr Ser lie lie Ser Cys Tyr 

195 200 205 

Gin Leu Leu Pro Thr Arg Pro Lys lie Lys Thr Pro 
210 215 220 

INFORMATION FOR SEQ ID NO: 153: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
DNA (genomic) 



(2) 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



1. .769 

(D) OTHER INFORMATION: / Ceres Seq. ID 1033557 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
ggtatcgtcc gtagacgata gaggttccac actatgtaaa ggagaccgga gaaacaaagg 60 
aactgggttt ggagataaga agtgggaact tgaatggctt ccttatccac ttccgttgta 120 
gcttcggctt cgtcgcgttt atggaatcct gctgcctcca acggcaagat ttgcgttcct 180 
tctgcttcgc tttctctgcg tacgggttgt agacggagtt cgtcctctct cacttcttcc 240 
gcttcttcgc aattgcttca ttgctcgttt ctctcgtcgc ctgtttccct agcgtctcca 300 
ttttctggtt tgtccattgc gtttgatctc agcagtcaaa ctagtggact gaatggccag 360 
agacgcagag gccttgtggt tagagctgga aaagctgctc tgtgtcaaac taagagaagc 420 
agatcaagaa aatctctagc taggactcat ggtttccgta gaaggatgag gaccactagc 480 
ggtagagcaa ccataaagcg tcgacgtgcc aagggacgtt ggaacctctg tcccaagtcc 540 
aaccctagca gcggcaaacg ggcttgaagt ttgtttctct cttcactcta tctgtaatct 600 
tgtttcatgt gctttttttc caccagaaga tacaatgaga aacatggttc gttgagctct 660 
tttctgcgtt ttaactctct tgcatccttg attcgtttat aaatcaatgt tcttggaatt 720 
catcacatgt actactatac tacttgcttc attgaatttg ttactactc 
(2) INFORMATION FOR SEQ ID NO: 154: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 



(D) TOPOLOGY: 
(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



linear 
peptide 



peptide 
1. . 157 

(D) OTHER INFORMATION: / Ceres Seq. ID 1033558 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 4: 
Met Ala Ser Leu Ser Thr Ser Val Val Ala Ser Ala Ser Ser Arg Leu 
15 10 15 

Trp Asn Pro Ala Ala Ser Asn Gly Lys lie Cys Val Pro Ser Ala Ser 

20 25 30 

Leu Ser Leu Arg Thr Gly Cys Arg Arg Ser Ser Ser Ser Leu Thr Ser 

35 " 4 0 4 5 

Ser Ala Ser Ser Gin Leu Leu His Cys Ser Phe Leu Ser Ser Pro Val 

50 55 60 

Ser Leu Ala Ser Pro Phe Ser Gly Leu Ser lie Ala Phe Asp Leu Ser 
65 70 75 80 

Ser Gin Thr Ser Gly Leu Asn Gly Gin Arg Arg Arg Gly Leu Val Val 

85 90 95 

Arg Ala Gly Lys Ala Ala Leu Cys Gin Thr Lys Arg Ser Arg Ser Arg 

100 105 110 

Lys Ser Leu Ala Arg Thr His Gly Phe Arg Arg Arg Met Arg Thr Thr 

115 ^ 120 125 

Ser Gly Arg Ala Thr lie Lys Arg Arg Arg Ala Lys Gly Arg Trp Asn 

130 135 140 

Leu Cys Pro Lys Ser Asn Pro Ser Ser Gly Lys Arg Ala 
145 " 150 155 

(2) INFORMATION FOR SEQ ID NO: 155: 



WO 00/40695 



PCT/US00/00466 



397 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 679 

(D) OTHER INFORMATION: / Ceres Seq. ID 1034688 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 : 



aaaaaaaaac aaaacaaaaa attatattca agagaaaaag gaaaaaatga atttcatctc 60 

cgatcaggta aagaaactct caagctcaac accagaggag ccagaccaca acaagccagt 120 

cgaaggaacc gaaacagcta caagaccagc taccaacgcc gagctcatgg caagtgccaa 180 

ggttgtagct gaagctgctc aagccgcagc tcgtaacgaa tcagacaaac tcgacaaggg 24 0 

taaagtcgcc ggagcctctg ctgatatctt agacgctgcc gagaaatacg gtaagttcga 300 

tgaaaagagt agcactggtc agtacctcga caaggctgag aagtatctca acgactacga 360 

gtcgtcacac tccaccggtg ctggtggtcc tcctcctcca acgagtcagg ctgagccagc 420 

aagtcagcct gagccggcgg ctaagaaaga cgatgaagag tctggtggtg ggcttggagg 480 

ttatgccaag atggctcaag gtttcttgaa gtgatttgat ctttaattgt tgttcatcat 540 

tttcgtaata ataaattaaa taactagtat cgtttgtgac tagtttatgt tgcttcgttt 600 

atgtttatgg ggagtgacga gtgagtgtaa taacttctgg tgatcatgaa tctaatccat 660 



ctttgttgtg attatgctc 

(2) INFORMATION FOR SEQ ID NO: 156: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..170 

(D) OTHER INFORMATION: / Ceres Seq. ID 1034689 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



Lys 


Lys 


Asn 


Lys 


Thr 


Lys 


Asn 


Tyr 


He 


Gin 


Glu 


Lys 


Lys 


Glu 


Lys 


Met 


1 








5 










10 










15 




Asn 


Phe 


He 


Ser 


Asp 


Gin 


Val 


Lys 


Lys 


Leu 


Ser 


Ser 


Ser 


Thr 


Pro 


Glu 








20 










25 










30 






Glu 


Pro 


Asp 


His 


Asn 


Lys 


Pro 


Val 


Glu 


Gly 


Thr 


Glu 


Thr 


Ala 


Thr 


Arg 






35 










40 










45 








Pro 


Ala 


Thr 


Asn 


Ala 


Glu 


Leu 


Met 


Ala 


Ser 


Ala 


Lys 


Val 


Val 


Ala 


Glu 




50 










55 










60 










Ala 


Ala 


Gin 


Ala 


Ala 


Ala 


Arg 


Asn 


Glu 


Ser 


Asp 


Lys 


Leu 


Asp 


Lys 


Gly 


65 










70 










75 










80 


Lys 


Val 


Ala 


Gly 


Ala 


Ser 


Ala 


Asp 


He 


Leu 


Asp 


Ala 


Ala 


Glu 


Lys 


Tyr 










85 










90 










95 




Gly 


Lys 


Phe 


Asp 


Glu 


Lys 


Ser 


Ser 


Thr 


Gly 


Gin 


Tyr 


Leu 


Asp 


Lys 


Ala 








100 










105 










110 






Glu 


Lys 


Tyr 


Leu 


Asn 


Asp 


Tyr 


Glu 


Ser 


Ser 


His 


Ser 


Thr 


Gly 


Ala 


Gly 






115 










120 










125 








Gly 


Pro 


Pro 


Pro 


Pro 


Thr 


Ser 


Gin 


Ala 


Glu 


Pro 


Ala 


Ser 


Gin 


Pro 


Glu 




130 










135 










140 










Pro 


Ala 


Ala 


Lys 


Lys 


Asp 


Asp 


Glu 


Glu 


Ser 


Gly 


Gly 


Gly 


Leu 


Gly 


Gly 


145 










150 










155 










160 


Tyr 


Ala 


Lys 


Met 


Ala 


Gin 


Gly 


Phe 


Leu 


Lys 















165 170 



(2) INFORMATION FOR SEQ ID NO: 15 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
5 (B) LOCATION: 1..155 

(D) OTHER INFORMATION: / Ceres Seq. ID 1034690 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
Met Asn Phe lie Ser Asp Gin Val Lys Lys Leu Ser Ser Ser Thr Pro 
15 10 15 

10 Glu Glu Pro Asp His Asn Lys Pro Val Glu Gly Thr Glu Thr Ala Thr 

20 " 25 30 

Arg Pro Ala Thr Asn Ala Glu Leu Met Ala Ser Ala Lys Val Val Ala 

35 40 45 

Glu Ala Ala Gin Ala Ala Ala Arg Asn Glu Ser Asp Lys Leu Asp Lys 
15 50 55 60 

Gly Lys Val Ala Gly Ala Ser Ala Asp lie Leu Asp Ala Ala Glu Lys 
65 70 75 80 

Tyr Gly Lys Phe Asp Glu Lys Ser Ser Thr Gly Gin Tyr Leu Asp Lys 
85 90 ~ 95 

20 Ala Glu Lys Tyr Leu Asn Asp Tyr Glu Ser Ser His Ser Thr Gly Ala 

100 " 105 110 

Gly Gly Pro Pro Pro Pro Thr Ser Gin Ala Glu Pro Ala Ser Gin Pro 

115 120 125 

Glu Pro Ala Ala Lys Lys Asp Asp Glu Glu Ser Gly Gly Gly Leu Gly 
25 130 " 135 140 

Gly Tyr Ala Lys Met Ala Gin Gly Phe Leu Lys 
145 150 " 155 

(2) INFORMATION FOR SEQ ID NO: 158: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

35 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..115 

(D) OTHER INFORMATION: / Ceres Seq. ID 1034691 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 
4 0 Met Ala Ser Ala Lys Val Val Ala Glu Ala Ala Gin Ala Ala Ala Arg 
15 10 15 

Asn Glu Ser Asp Lys Leu Asp Lys Gly Lys Val Ala Gly Ala Ser Ala 

20 " 25 " 30 

Asp lie Leu Asp Ala Ala Glu Lys Tyr Gly Lys Phe Asp Glu Lys Ser 
4 5 35 4 0 4 5 

Ser Thr Gly Gin Tyr Leu Asp Lys Ala Glu Lys Tyr Leu Asn Asp Tyr 

50 ^ 55 " ~ 60 

Glu Ser Ser His Ser Thr Gly Ala Gly Gly Pro Pro Pro Pro Thr Ser 
65 7 0 7 5 8 0 

50 Gin Ala Glu Pro Ala Ser Gin Pro Glu Pro Ala Ala Lys Lys Asp Asp 

85 90 95 

Glu Glu Ser Gly Gly Gly Leu Gly Gly Tyr Ala Lys Met Ala Gin Gly 
100 " " 105 ^ 110 

Phe Leu Lys 
55 115 

(2) INFORMATION FOR SEQ ID NO: 159: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..471 

(D) OTHER INFORMATION: / Ceres Seq. ID 1035033 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

atataaaaaa aaaacttatt ttgttcattc ttttaagttt aaaaacagtc aattcattta 60 
tatcatctat gatgaagaaa ctcattcaag tctcgttcac tgttatgatt atcttcacca 120 
ttcttgtgct aggagtggtg gcaaatgagg ggctaggaaa accaaaaaaa caatgtaatg 180 
agattctaaa gcaatccaac tgtgttgctg cagagtgtga ctctatgtgt gtgaagaaga 240 

10 gggggaaagg agccggttac tgctctcctt ctaaaaagtg ttactgctat tatcattgcc 300 
cttaacgatt ttattattta caataataag agatttatga taaagtagta aaataatgaa 360 
tcgggtgttt atagcctaac gattgatata gttgtctcta tcaaataaac tatatagact 420 
aaaagtaata ataataatga caatcataac tcagaagttt ttgtttagtc c 
(2) INFORMATION FOR SEQ ID NO: 160: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
2 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..100 

(D) OTHER INFORMATION: / Ceres Seq. ID 1035034 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

lie Lys Lys Lys Leu lie Leu Phe lie Leu Leu Ser Leu Lys Thr Val 
1 "5 10 15 

Asn Ser Phe lie Ser Ser Met Met Lys Lys Leu lie Gin Val Ser Phe 
20 25 30 

30 Thr Val Met lie lie Phe Thr lie Leu Val Leu Gly Val Val Ala Asn 
35 40 45 

Glu Gly Leu Gly Lys Pro Lys Lys Gin Cys Asn Glu lie Leu Lys Gin 

50 ^ " 55 " ' 60 

Ser Asn Cys Val Ala Ala Glu Cys Asp Ser Met Cys Val Lys Lys Arg 
35 65 ~ 70 75 80 

Gly Lys Gly Ala Gly Tyr Cys Ser Pro Ser Lys Lys Cys Tyr Cys Tyr 
85 90 95 

Tyr His Cys Pro 
100 

4 0 (2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

4 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..78 

50 (D) OTHER INFORMATION: / Ceres Seq. ID 1035035 



55 



60 





(xi) 


SEQUENCE DESCRIPTION: SEQ 


ID NO:161: 








Met 


Met 


Lys Lys 


Leu 


He 


Gin Val Ser 


Phe 


Thr 


Val 


Met He 


He 


Phe 


1 






5 






10 








15 




Thr 


He 


Leu Val 


Leu 


Gly 


Val Val Ala 


Asn 


Glu 


Gly 


Leu Gly 


Lys 


Pro 






20 






25 








30 






Lys 


Lys 


Gin Cys 


Asn 


Glu 


He Leu Lys 


Gin 


Ser 


Asn 


Cys Val 


Ala 


Ala 






35 






4 0 








45 






Glu 


Cys 


Asp Ser 


Met 


Cys 


Val Lys Lys 


Arg 


Gly 


Lys 


Gly Ala 


Gly 


Tyr 




50 








55 






60 








Cys 


Ser 


Pro Ser 


Lys 


Lys 


Cys Tyr Cys 


Tyr 


Tyr 


His 


Cys Pro 






65 








70 






75 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO:162: 
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(xi) 


SEQUENCE DESCRIPTION: SEQ 


ID NO: 162 










Met 


Lys 


Lys Leu 


He 


Gin 


Val Ser Phe 


Thr 


Val 


Met He 


He 


Phe 


Thr 


1 






5 






10 








15 




He 


Leu 


Val Leu 


Gly 


Val 


Val Ala Asn 


Glu 


Gly 


Leu Gly 


Lys 


Pro 


Lys 






20 






25 








30 






Lys 


Gin 


Cys Asn 


Glu 


He 


Leu Lys Gin 


Ser 


Asn 


Cys Val 


Ala 


Ala 


Glu 






35 






40 






45 








Cys 


Asp 


Ser Met 


Cys 


Val 


Lys Lys Arg 


Gly 


Lys 


Gly Ala 


Gly 


Tyr 


Cys 




50 








55 






60 








Ser 


Pro 


Ser Lys 


Lys 


Cys 


Tyr Cys Tyr 


Tyr 


His 


Cys Pro 








65 








70 






75 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 163 : 















400 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..77 

10 (D) OTHER INFORMATION: / Ceres Seq. ID 1035036 



15 



20 

65 

(2) I 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

3 0 (A) NAME/KEY: - 

(B) LOCATION: 1 . . 985 

(D) OTHER INFORMATION: / Ceres Seq. ID 1035071 
(xi) SEQUENCE DESCRIPTION: SEQ T D NO: 163: 
taacgcactc tcttctccga atctagggtt tttgtctcta gcctcctcaa gctttgcaaa 60 
35 caatcatggc gacccagatc agcaagaaga gaaagttcgt agccgatggt gtgttttacg 120 
ctgaactcaa tgaggttcta accagagagc ttgctgagga tggttactct ggtgttgagg 180 
ttcgcgtcac tcctatgcgt actgagatca tcattagagc cactcgtact caaaacgttc 240 
tcggtgagaa gggaaggagg attagggaat tgacatctct tgtacaaaag agattcaaat 300 
ttcctcagga cagtgttgag ctttatgctg agaaggttgc taacagaggt ctttgtgcta 360 

4 0 ttgctcaggc tgagtctctt cgttacaagc ttcttggtgg tcttgctgtt cgtagggcgt 420 

gctatggtgt tttaaggttt gttatggaga gtggagctaa aggatgcgaa gtcattgtga 180 
gtggaaaact ccgtgctgca cgtgctaagt agatgaagtt caaggatggt tacatggttt 540 
cctcaggtca gccaaccaag gaatacattg atgctgcagt taggcatgtt cttcttagac 600 
agggtgtgct tggtctcaag gtgaagatca tgcttgactg ggaccccaag ggcaaacaag 660 

4 5 gaccgatgac accattgcct gatgttgtga tcatccatac accgaaagaa gatgatgtat 720 
acattgcacc tgctcaggtt gttactcaag ctgcttttgt accggaagct ccattaacca 780 
ccacagatta tccagcaatg ccagttgctt agagagcctt ttcgttttct ggtttaaaac 840 
ccaatgggga tggttctttt tttgggacag tactagtttt ctcatgttat tctaagttaa 900 
tacttaaagt tctggaggtt gaattacatc aaggatatga atgtatgttt ttcgttatct 960 

50 tttgtttaaa ttttatttat ctatt 

(2) INFORMATION FOR SEQ ID NO: 164: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 
55 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
60 (B) LOCATION: 1..148 

(D) OTHER INFORMATION: / Ceres Seq. ID 1035072 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
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Met 


Ala 


Thr 


Gin 


lie 


Ser 


Lys 


Lys 


Arg 


Lys 


Phe 


Val 


Ala 


Asp 


Gly 


Val 


1 








5 










10 










15 




Phe 


Tyr 


Ala 


Glu 


Leu 


Asn 


Glu 


Val 


Leu 


Thr 


Arg 


Glu 


Leu 


Ala 


Glu 


Asp 








20 










25 










30 






Gly 


Tyr 


Ser 


Gly 


Val 


Glu 


Val 


Arg 


Val 


Thr 


Pro 


Met 


Arg 


Thr 


Glu 


He 






35 










40 










45 








He 


He 


Arg 


Ala 


Thr 


Arg 


Thr 


Gin 


Asn 


Val 


Leu 


Gly 


Glu 


Lys 


Gly Arg 




50 










55 










60 










Arg 


He 


Arg 


Glu 


Leu 


Thr 


Ser 


Leu 


Val 


Gin 


Lys 


Arg 


Phe 


Lys 


Phe 


Pro 


65 










70 










75 










80 


Gin 


Asp 


Ser 


Val 


Glu 


Leu 


Tyr 


Ala 


Glu 


Lys 


Val 


Ala 


Asn 


Arg 


Gly 


Leu 










85 










90 










95 




Cys 


Ala 


He 


Ala 


Gin 


Ala 


Glu 


Ser 


Leu 


Arg 


Tyr 


Lys 


Leu 


Leu 


Gly 


Gly 








100 










105 










110 






Leu 


Ala 


Val 


Arg 


Arg 


Ala 


Cys 


Tyr 


Gly 


Val 


Leu 


Ara 


Phe 


Val 


Met 


Glu 






115 










120 










125 








Ser 


Gly 


Ala 


Lys 


Gly 


Cys 


Glu 


Val 


He 


Val 


Ser 


Gly 


Lys 


Leu 


Arg 


Ala 




130 










135 










140 










Ala 


Arg 


Ala 


Lys 


























145 
































(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 165: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
30 (B) LOCATION: 1..105 

(D) OTHER INFORMATION: / Ceres Seq. ID 1035073 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 





Met 


Arg 


Thr 


Glu 


He 


He 


He 


Arg 


Ala 


Thr 


Arg 


Thr 


Gin 


Asn 


Val 


Leu 




1 








5 










10 










15 




35 


Gly 


Glu 


Lys 


Gly 
20 


Arg 


Arg 


He 


Arg 


Glu 
25 


Leu 


Thr 


Ser 


Leu 


Val 
30 


Gin 


Lys 




Arg 


Phe 


Lys 
35 


Phe 


Pro 


Gin 


Asp 


Ser 
40 


Val 


Glu 


Leu 


Tyr 


Ala 
45 


Glu 


Lys 


Val 




Ala 


Asn 


Arg 


Gly 


Leu 


Cys 


Ala 


He 


Ala 


Gin 


Ala 


Glu 


Ser 


Leu 


Arg 


Tyr 


40 




50 










55 










60 












Lys 


Leu 


Leu 


Gly 


Gly 


Leu 


Ala 


Val 


Arg 


Arg 


Ala 


Cys 


Tyr 


Gly 


Val 


Leu 




65 










70 










75 










80 




Arg 


Phe 


Val 


Met 


Glu 
85 


Ser 


Gly 


Ala 


Lys 


Gly 
90 


Cys 


Glu 


Val 


He 


Val 

95 


Ser 


45 


Gly 


Lys 


Leu 


Arg 
100 


Ala 


Ala 


Arg 


Ala 


Lys 
105 

















(2) INFORMATION FOR SEQ ID NO: 166: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

55 (A) NAME/KEY: peptide 

(B) LOCATION: 1..99 

(D) OTHER INFORMATION: / Ceres Seq. ID 1035074 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
Met Lys Phe Lys Asp Gly Tyr Met Val Ser Ser Gly Gin Pro Thr Lys 
60 1 5 10 15 

Glu Tyr He Asp Ala Ala Val Arg His Val Leu Leu Arg Gin Gly Val 
20 25 30 
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Leu Gly Leu Lys Val Lys lie Met Leu Asp Trp Asp Pro Lys Gly Lys 

35 * 40 45 

Gin Gly Pro Met Thr Pro Leu Pro Asp Val Val lie lie His Thr Pro 

50 55 ~ 60 

Lys Glu Asp Asp Val Tyr lie Ala Pro Ala Gin Val Val Thr Gin Ala 

65 70 75 80 

Ala Phe Val Pro Glu Ala Pro Leu Thr Thr Thr Asp Tyr Pro Ala Met 
85 90 95 

Pro Val Ala 



(2) INFORMATION FOR SEQ ID NO: 167: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
2 0 (B) LOCATION: 1..75 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1376587 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
atccgcagaa aaccctattg ctcccctttg ctcaagctca gtgcctcttt gcagcgaaag 60 
ctcaaccaac aagacgatga ttatctcaga gaacaatcgc agagagatct gcaaatacct 120 
2 5 tttcaaagag ggagtatgct ttgctaagaa ggatttcaat ctcccaaagc atccgttgat 180 
tgatgtacca aacctgcaag tgattaagct catgcagagt ttcaagtcca aggagtacgt 240 
tagggagaca tttgcgtgga tgcattatta ttggtttctg actaatgaag gaattgagtt 300 
cttgagaact tatcttaacc ttccttccga tgttgtccct gctactttga agaagtctgc 360 
taagcccggt ggtcgtccct ttggtggccc acctggtgat cgccaaagag gaccacctcg 420 
30 ctctgatgga gaccgtccca gatttggtga ccgtgatgga taccgtggag gcccacgagg 480 
tggtgatgag aagagtggag ctccagctga tttccagccg tctttccaag gaggtggtgg 54 0 

taggcctggt tttggccgtg gtgcaggcgg ttacagtgca gcagcaccat ctggttcagg 600 
gttcccttga aaaatttatt gtcatattgc gacgatggaa ggacagtttt gttttttgtt 660 
ctagttctgt ttgtgta^tg caaatctgga atctataatc tatctattac cttcaccttg 720 
35 gtttaaacga acaaatcctg tagtttcagg attagtgcc 
(2) INFORMATION FOR SEQ ID NO: 168: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 

(B) TYPE: amino acid 
4 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
45 (B) LOCATION: 1..202 

(D) OTHER INFORMATION: / Ceres Seq. ID 1376588 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
Ser Ala Glu Asn Pro lie Ala Pro Leu Cys Ser Ser Ser Val Pro Leu 
15 10 15 

50 Cys Ser Glu Ser Ser Thr Asn Lys Thr Met lie lie Ser Glu Asn Asn 

20 25 30 

Arg Arg Glu lie Cys Lys Tyr Leu Phe Lys Glu Gly Val Cys Phe Ala 

35 ' 4 0 ^ 4 5 

Lys Lys Asp Phe Asn Leu Pro Lys His Pro Leu lie Asp Val Pro Asn 
55 50 55 60 

Leu Gin Val lie Lys Leu Met Gin Ser Phe Lys Ser Lys Glu Tyr Val 
65 ~ 70 75 80 

Arg Glu Thr Phe Ala Trp Met His Tyr Tyr Trp Phe Leu Thr Asn Glu 
85 " 90 95 

60 Gly lie Glu Phe Leu Arg Thr Tyr Leu Asn Leu Pro Ser Asp Val Val 

100 105 110 

Pro Ala Thr Leu Lys Lys Ser Ala Lys Pro Gly Gly Arg Pro Phe Gly 
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115 










120 










125 








Gly 


Pro 
130 


Pro 


Gly 


Asp 


Arg 


Gin 
135 


Arg 


Gly 


Pro 


Pro 


Arg 
140 


Ser 


Asp 


Gly 


Asp 


Arg 


Pro 


Arg 


Phe 


Gly 


Asp 


Arg 


Asp 


Gly 


Tyr 


Arg 


Gly 


Gly 


Pro 


Arg 


Gly 


145 










150 










155 










160 


Gly 


Asp 


Glu 


Lys 


Ser 
165 


Gly 


Ala 


Pro 


Ala 


Asp 
170 


Phe 


Gin 


Pro 


Ser 


Phe 
175 


Gin 


Gly 


Gly 


Gly 


Gly 
180 


Arg 


Pro 


Gly 


Phe 


Gly 
185 


Arg 


Gly 


Ala 


Gly 


Gly 
190 


Tyr 


Ser 


Ala 


Ala 


Ala 
195 


Pro 


Ser 


Gly 


Ser 


Gly 
200 


Phe 


Pro 















(2) INFORMATION FOR SEQ ID NO: 169: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

20 (A) NAME/KEY: peptide 

(B) LOCATION: 1..177 

(D) OTHER INFORMATION: / Ceres Seq. ID 1376589 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 





Met 


He 


He 


Ser 


Glu 


Asn 


Asn 


Arg 


Arg 


Glu 


He 


Cys 


Lys 


Tyr 


Leu 


Phe 


25 


1 








5 










10 










15 






Lys 


Glu 


Gly 


Val 

20 


Cys 


Phe 


Ala 


Lys 


Lys 
25 


Asp 


Phe 


Asn 


Leu 


Pro 

30 


Lys 


His 




Pro 


Leu 


He 
35 


Asp 


Val 


Pro 


Asn 


Leu 
40 


Gin 


Val 


He 


Lys 


Leu 
45 


Met 


Gin 


Ser 


30 


Phe 


Lys 
50 


Ser 


Lys 


Glu 


Tyr 


Val 
55 


Arg 


Glu 


Thr 


Phe 


Ala 
60 


Trp 


Met 


His 


Tyr 




Tyr 


Trp 


Phe 


Leu 


Thr 


Asn 


Glu 


Gly 


He 


Glu 


Phe 


Leu 


Arg 


Thr 


Tyr 


Leu 




65 










70 










75 










80 




Asn 


Leu 


Pro 


L'er 


Asp 


Val 


Val 


Pro 


Ala 


Thr 


Leu 


Lys 


Lys 


Sex 


Ala 


Lys 


35 










85 










90 










95 






Pro 


Gly 


Gly 


Arg 
100 


Pro 


Phe 


Gly 


Gly 


Pro 
105 


Pro 


Gly 


Asp 


Arg 


Gin 
110 


Arg 


Gly 




Pro 


Pro 


Arg 
115 


Ser 


Asp 


Gly 


Asp 


Arg 
120 


Pro 


Arg 


Phe 


Gly 


Asp 
125 


Arg 


Asp 


Gly 


40 


Tyr 


Arg 
130 


Gly 


Gly 


Pro 


Arg 


Gly 
135 


Gly 


Asp 


Glu 


Lys 


Ser 
140 


Gly 


Ala 


Pro 


Ala 




Asp 


Phe 


Gin 


Pro 


Ser 


Phe 


Gin 


Gly 


Gly 


Gly 


Gly 


Arg 


Pro 


Gly 


Phe 


Gly 




145 










150 










155 










160 




Arg 


Gly 


Ala 


Gly 


Gly 


Tyr 


Ser 


Ala 


Ala 


Ala 


Pro 


Ser 


Gly 


Ser 


Gly 


Phe 


45 










165 










170 










175 





Pro 

(2) INFORMATION FOR SEQ ID NO: 170: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

55 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..132 

(D) OTHER INFORMATION: / Ceres Seq. ID 1376590 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
60 Met Gin Ser Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp 
1 5 10 • 15 

Met His Tyr Tyr Trp Phe Leu Thr Asn Glu Gly lie Glu Phe Leu Arg 
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20 25 30 

Thr Tyr Leu Asn Leu Pro Ser Asp Val Val Pro Ala Thr Leu Lys Lys 

35 40 45 

Ser Ala Lys Pro Gly Gly Arg Pro Phe Gly Gly Pro Pro Gly Asp Arg 
5 50 55 60 

Gin Arg Gly Pro Pro Arg Ser Asp Gly Asp Arg Pro Arg Phe Gly Asp 
65 70 75 80 

Arg Asp Gly Tyr Arg Gly Gly Pro Arg Gly Gly Asp Glu Lys Ser Gly 
85 90 95 

10 Ala Pro Ala Asp Phe Gin Pro Ser Phe Gin Gly Gly Gly Gly Arg Pro 

100 105 110 

Gly Phe Gly Arg Gly Ala Gly Gly Tyr Ser Ala Ala Ala Pro Ser Gly 

115 120 125 

Ser Gly Phe Pro 
15 . 130 

(2) INFORMATION FOR SEQ ID NO: 171: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
2 5 (B) LOCATION: 1..64 5 

(D) OTHER INFORMATION: / Ceres Seq. ID 1378581 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
gatttcccat tttggttgca gcgatcggag agctctga< 
ttcgcaaatc caaataagct gtggatgaaa cgaagaat- 
30 ccatttatag cgggggttgc agtagctgca acagcact' 
gcatggcaag cattcaaggc aaggccacca aggcccaa* 



35 



gtagcggcgg aaaaggtgaa 
gcaggtggta gccatttcct 



gcaccagcca 


cctctgcggc 


60 


tagatttctg 


aatggcgaca 


120 


ctggtagata 


tggaatccaa 


180 


tt aagaaatt 


ttatgagggc 


240 


tt-cttggcgt 


cagggagagt 


300 


tge "agcaaa 


ccatccagat 


360 


ctaaagacgt 


gatgctaggc 


420 


aatcaaaaga 


cttccttatg 


480 


ttttatgtta 


atttataagc 


540 


tttcacttgt 


tgtcgctgaa 


600 


agtcg 







aatcatctta tgttatagga ctgtaatgtc aaaatgcaga 
ttgtggtctg tttgtgacac ataaagactc gaatcttttc 
aagaacacaa aaataaaaac gcaaagcaga agggcgttct 
(2) INFORMATION FOR SEQ ID NO: 172: 
4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 112 

(D) OTHER INFORMATION: / Ceres Seq. ID 1378582 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Met Ala Thr Pro Phe lie Ala Gly Val Ala Val Ala Ala Thr Ala Leu 
15 10 15 

Ala Gly Arg Tyr Gly lie Gin Ala Trp Gin Ala Phe Lys Ala Arg Pro 
20 25 30 

55 Pro Arg Pro Lys lie Lys Lys Phe Tyr Glu Gly Gly Phe Gin Pro Thr 
35 40 45 

Met Thr Lys Arg Glu Ala Ala Leu lie Leu Gly Val Arg Glu Ser Val 

50 55 60 

Ala Ala Glu Lys Val Lys Glu Ala His Arg Lys Val Met Val Ala Asn 
60 65 70 75 80 

His Pro Asp Ala Gly Gly Ser His Phe Leu Ala Ser Lys lie Asn Glu 
85 90 95 
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Met 


Thr 


Lys 


Arg 


Glu 


Ala 


Ala 


Leu 


He 


Leu 


Gly 


Val 


Arg 


Glu 


Ser 


Val 


1 








5 










10 










15 




Ala 


Ala 


Glu 


Lys 
20 


Val 


Lys 


Glu 


Ala 


His 
25 


Arg 


Lys 


Val 


Met 


Val 

30 


Ala 


Asn 


His 


Pro 


Asp 
35 


Ala 


Gly 


Gly 


Ser 


His 
40 


Phe 


Leu 


Ala 


Ser 


Lys 
45 


He 


Asn 


Glu 


Ala 


Lys 
50 


Asp 


Val 


Met 


Leu 


Gly 
55 


Lys 


Thr 


Lys 


Asn 


Ser 
60 


Gly 


Ser 


Ala 


Phe 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO:174: 
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Ala Lys Asp Val Met Leu Gly Lys Thr Lys Asn Ser Gly Ser Ala Phe 

100 105 ~ 110 

(2) INFORMATION FOR SEQ ID NO: 17 3: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..64 

(D) OTHER INFORMATION: / Ceres Seq. ID 1378583 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

15 



20 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 803 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 803 

(D) OTHER INFORMATION: / Ceres Seq. ID 1383462 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

35 aggacacgtt tctatctaga gagaacacaa cacatcgaac catggagaaa tcaacaagaa 60 
ccctattcat aaccatcgtg ataacctcca tgttgctagg gtttggaaac tctgatcttg 120 
ctcaggacag agaggagtgt acgaaccagc tcatagaact atccacgtgt attccgtacg 180 
ttggaggaga cgccaaggct ccaacaaaag attgttgtgc agggtttggc caagttataa 240 
gaaagagtga gaagtgtgtt tgcatattgg tcagagacaa agatgatcct caacttggta 300 

40 tcaagattaa cgcaacccta gccgctcatc ttccctccgc ttgtcatatc acggctccta 360 
acatcactga ctgtatttcg attctgcatc tacctcgtaa ctcaacattg gctaaagagt 420 
ttgagaactt aggaaggatt gaagacaact acaactccac atctcctaca caaattcata 480 
aagatgggac aggaggagga aaagccgaac cagtgaagag taatggatgg aaggagaaga 540 
gttggttggg tgttgagctt ttaatatatt tgctagtttc tctcatcttc ttttaattac 600 

45 ttcctttctt taatgttcct ttgctttatt attacaataa cttataagat atctatttat 660 
atatgagaca catgcatgtg tttgcgatct atatgcatgt aaatgagtca ctggttaatt 720 
tttaatcacg aagcgtacat tttaatttcg ttgttgccaa ttgatcttgt acgttagtga 780 
attttatgga ttatcgactt act 
(2) INFORMATION FOR SEQ ID NO: 175: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
55 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 197 

(D) OTHER INFORMATION: / Ceres Seq. ID 1383463 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

Asp Thr Phe Leu Ser Arg Glu Asn Thr Thr His Arg Thr Met Glu Lys 
15 10 15 
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Ser 


Thr 


Arg 


Thr 
20 


Leu 


Phe 


He 


Thr 


He 
25 


Val 


He 


Thr 


Ser 


Met 
30 


Leu 


Leu 


Gly 


Phe 


Gly 
35 


Asn 


Ser 


Asp 


Leu 


Ala 
40 


Gin 


Asp 


Arg 


Glu 


Glu 
45 


Cys 


Thr 


Asn 


Gin 


Leu 
50 


He 


Glu 


Leu 


Ser 


Thr 
55 


Cys 


He 


Pro 


Tyr 


Val 
60 


Gly 


Gly 


Asp 


Ala 


Lys 


Ala 


Pro 


Thr 


Lys 


Asp 


Cys 


Cys 


Ala 


Gly 


Phe 


Gly 


Gin 


Val 


He 


Arg 


65 










70 










75 










80 


Lys 


Ser 


Glu 


Lys 


Cys 
85 


Val 


Cys 


He 


Leu 


Val 
90 


Arg 


Asp 


Lys 


Asp 


Asp 
95 


Pro 


Gin 


Leu 


Gly 


He 
100 


Lys 


He 


Asn 


Ala 


Thr 
105 


Leu 


Ala 


Ala 


His 


Leu 
110 


Pro 


Ser 


Ala 


Cys 


His 
115 


He 


Thr 


Ala 


Pro 


Asn 
120 


He 


Thr 


Asp 


Cys 


He 
125 


Ser 


He 


Leu 


His 


Leu 
130 


Pro 


Arg 


Asn 


Ser 


Thr 
135 


Leu 


Ala 


Lys 


Glu 


Phe 
140 


Glu 


Asn 


Leu 


Gly 


Arg 


He 


Glu 


Asp 


Asn 


Tyr 


Asn 


Ser 


Thr 


Ser 


Pro 


Thr 


Gin 


He 


His 


Lys 


145 










150 










155 










160 


Asp 


Gly 


Thr 


Gly 


Gly 
165 


Gly 


Lys 


Ala 


Glu 


Pro 
170 


Val 


Lys 


Ser 


Asn 


Gly 
175 


Trp 


Lys 


Glu 


Lys 


Ser 
180 


Trp 


Leu 


Gly 


Val 


Glu 
185 


Leu 


Leu 


He 


Tyr 


Leu 
190 


Leu 


Val 


Ser 


Leu 


He 
195 


Phe 


Phe 

























(2) INFORMATION FOR SEQ ID NO: 17 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptic.3 

(B) LOCATION: 1..18 4 

(D) OTHER INFORMATION: / Ceres Seq. ID 1383464 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



Met 


Glu 


Lys 


Ser 


Thr 


Arg 


Thr 


Leu 


Phe 


He 


Thr 


He 


Val 


He 


Thr 


Ser 


1 








5 










10 










15 




Met 


Leu 


Leu 


Gly 


Phe 


Gly 


Asn 


Ser 


Asp 


Leu 


Ala 


Gin 


Asp 


Arg 


Glu 


Glu 








20 










25 










30 






Cys 


Thr 


Asn 


Gin 


Leu 


He 


Glu 


Leu 


Ser 


Thr 


Cys 


He 


Pro 


Tyr 


Val 


Gly 






35 










40 










45 








Gly 


Asp 


Ala 


Lys 


Ala 


Pro 


Thr 


Lys 


Asp 


Cys 


Cys 


Ala 


Gly 


Phe 


Gly 


Gin 




50 










55 










60 










Val 


He 


Arg 


Lys 


Ser 


Glu 


Lys 


Cys 


Val 


Cys 


He 


Leu 


Val 


Arg 


Asp 


Lys 


65 










70 










75 










80 


Asp 


Asp 


Pro 


Gin 


Leu 


Gly 


He 


Lys 


He 


Asn 


Ala 


Thr 


Leu 


Ala 


Ala 


His 










85 










90 










95 




Leu 


Pro 


Ser 


Ala 


Cys 


His 


He 


Thr 


Ala 


Pro 


Asn 


He 


Thr 


Asp 


Cys 


He 








100 










105 










110 






Ser 


He 


Leu 


His 


Leu 


Pro 


Arg 


Asn 


Ser 


Thr 


Leu 


Ala 


Lys 


Glu 


Phe 


Glu 






115 










120 










125 








Asn 


Leu 


Gly 


Arg 


He 


Glu 


Asp 


Asn 


Tyr 


Asn 


Ser 


Thr 


Ser 


Pro 


Thr 


Gin 




130 










135 










140 










He 


His 


Lys 


Asp 


Gly 


Thr 


Gly 


Gly 


Gly 


Lys 


Ala 


Glu 


Pro 


Val 


Lys 


Ser 


145 










150 










155 










160 


Asn 


Gly 


Trp 


Lys 


Glu 


Lys 


Ser 


Trp 


Leu 


Gly 


Val 


Glu 


Leu 


Leu 


He 


Tyr 










165 










170 










175 




Leu 


Leu 


Val 


Ser 


Leu 


He 


Phe 


Phe 



















180 



(2) INFORMATION FOR SEQ ID NO: 177: 
(i) SEQUENCE CHARACTERISTICS: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(ii) 
(ix) 



407 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..168 
(D) OTHER INFORMATION: 



/ Ceres Seq. ID 1383465 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
Met Leu Leu Gly Phe Gly Asn Ser Asp Leu Ala Gin Asp Arg Glu Glu 
15 10 15 

Cys Thr Asn Gin Leu lie Glu Leu Ser Thr Cys lie Pro Tyr Val Gly 

20 25 30 

Gly Asp Ala Lys Ala Pro Thr Lys Asp Cys Cys Ala Gly Phe Gly Gin 

35 40 45 

Val lie Arg Lys Ser Glu Lys Cys Val Cys lie Leu Val Arg Asp Lys 

50 55 60 

Asp Asp Pro Gin Leu Gly lie Lys lie Asn Ala Thr Leu Ala Ala His 
65 70 75 80 

Leu Pro Ser Ala Cys His lie Thr Ala Pro Asn lie Thr Asp Cys lie 

85 90 95 

Ser lie Leu His Leu Pro Arg Asn Ser Thr Leu Ala Lys Glu Phe Glu 

100 105 110 

Asn Leu Gly Arg lie Glu Asp Asn Tyr Asn Ser Thr Ser Pro Thr Gin 

115 ^ 120 125 

lie His Lys Asp Gly Thr Gly Gly Gly Lys Ala Glu Pro Val Lys Ser 

130 135 140 

Asn Gly Trp Lys Glu Lys Ser Trp Leu Gly Val Glu Leu Leu lie Tyr 
145 150 155 160 

Leu Leu Val Ser Leu lie Phe Phe 
165 

(2) INFORMATION FOR SEQ ID NO: 178: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 968 

(D) OTHER INFORMATION: / Ceres Seq. ID 1386215 
SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
aagaggaaaa caaaactgca aaaacaaaaa taaaaaaaac atcgcacaag aaaataaaag 
atttgtagaa tcaactaaga aaatggctag cactatgatg actacattgc ctcagttcaa 
tggtcttcga gccaccaaaa tctctgcagc tcctgtacaa ggcctggcaa gtgttcagcc 
catgagacgc aagggaaatg gagctttggg tgcaaagtgt gacttcatcg gttcatcaac 
aaatctgata atggtaacgt cgacgaccct gatgttgttc gcggggagat tcggacttgc 
gccatcagcc aataggaagg cgacagctgg acttaggttg gaggcacgtg actcaggtct 
acaaacgggt gacccggccg ggttcacgct tgcggacact ttggcttgtg cctccaatct 
aaccccaatt tcaccttctt ccccaaatct cgagtcaatg aagacgtccc cgatgaaacg 
cctctcactc cgaaatcggc aatggctctt gtcgcagctg gaaatatctc ggtggctaat 
atgaataaag cggtggctct gttaaaggaa gaagctctta agaagatcat agcagctaaa 
acggccaagt tgagagctaa aggtgcattg acgaatcttc aggatattgt tattcgacag 
agtaaggtca ccggcaagag gaaggaagat gaacgctgat tttctaaatc ttgatagcaa 
ttttactggt acgtagagct tttgtgtgat atatcgcttc tagggttcaa ggaaagacta 
cttttttgcg gttattggcc atttatcttg gtgaatttta gttcatattc agacgtgttt 
gatcaaatgt ctcccagaga aagctggaga ctttgaatta ggatacatta gggcttttat 
tttgtacaat ggtgtgtgta agactcagtg acattcattc atattaatcg gttccatctt 
caat cttt 

(2) INFORMATION FOR SEQ ID NO: 17 9: 



(ii) 
(ix) 



(xi) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..180 

10 (D) OTHER INFORMATION: / Ceres Seq. ID 1386216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 9; 
Arg Gly Lys Gin Asn Cys Lys Asn Lys Asn Lys Lys Asn lie Ala Gin 
1 5 ^ 10 15 

Glu Asn Lys Arg Phe Val Glu Ser Thr Lys Lys Met Ala Ser Thr Met 
15 ~ 20 25 " 30 

Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg Ala Thr Lys lie Ser 

35 40 45 

Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin Pro Met Arg Arg Lys 
50 ^55 60 

20 Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe lie Gly Ser Ser Thr 
65 70 75 80 

Asn Leu lie Met Val Thr Ser Thr Thr Leu Met Leu Phe Ala Gly Arg 

85 90 95 

Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala Thr Ala Gly Leu Arg 
25 100 105 110 

Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly Asp Pro Ala Gly Phe 

115 120 125 

Thr Leu Ala Asp Thr Leu Ala Cys Ala Ser Asn Leu Thr Pro lie Ser 
130 135 140 

30 Pro Ser Ser Pro Asn Leu Glu Ser Met Lys Thr Ser Pro Met Lys Arg 
145 150 155 160 

Leu Ser Leu Arg Asn Arg Gin Trp Leu Leu Ser Gin Leu Glu lie Ser 
165 " 170 175 

Arg Trp Leu lie 
35 180 

(2) INFORMATION FOR SEQ ID NO: 180: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
40 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
45 (B) LOCATION: 1..153 

(D) OTHER INFORMATION: / Ceres Seq. ID 1386217 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
Met Ala Ser Thr Met Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg 
15 10 15 

50 Ala Thr Lys lie Ser Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin 

20 25 30 

Pro Met Arg Arg Lys Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe 

35 4 0 "* 4 5 

lie Gly Ser Ser Thr Asn Leu lie Met Val Thr Ser Thr Thr Leu Met 
55 50 55 60 

Leu Phe Ala Gly Arg Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala 
65 70 75 80 

Thr Ala Gly Leu Arg Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly 
85 90 95 

60 Asp Pro Ala Gly Phe Thr Leu Ala Asp Thr Leu Ala Cys Ala Ser Asn 

100 105 " 110 

Leu Thr Pro lie Ser Pro Ser Ser Pro Asn Leu Glu Ser Met Lys Thr 
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115 120 125 

Ser Pro Met Lys Arg Leu Ser Leu Arg Asn Arg Gin Trp Leu Leu Ser 

130 135 140 

Gin Leu Glu lie Ser Arg Trp Leu lie 
5 145 150 

(2) INFORMATION FOR SEQ ID NO; 181: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
15 (B) LOCATION: 1..149 

<D) OTHER INFORMATION: / Ceres Seq. ID 1386218 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
Met Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg Ala Thr Lys lie 
15 10 15 

20 Ser Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin Pro Met Arg Arg 

20 25 30 

Lys Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe lie Gly Ser Ser 

35 40 45 

Thr Asn Leu lie Met Val Thr Ser Thr Thr Leu Met Leu Phe Ala Gly 
25 50 55 60 

Arg Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala Thr Ala Gly Leu 
65 70 75 80 

Arg Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly Asp Pro Ala Gly 
8 5 90 95 

30 Phe Thr Leu Ala Asp Thr Leu Ala Cys Ala Ser Asn Leu Thr Pro lie 

100 105 110 

Ser Pro Ser Ser Pro Asn Leu Glu Ser Met Lys Thr Ser Pro Met Lys 

115 120 125 

Arg Leu Ser Leu Arg Asn Arg Gin Trp Leu Leu Ser Gin Leu Glu lie 
35 130 135 140 

Ser Arg Trp Leu lie 
145 

(2) INFORMATION FOR SEQ ID NO: 18 2: 
(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 65 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025156 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
50 Met Ala Leu Val Ala Ala Gly Asn lie Ser Val Ala Asn Met Asn Lys 
15 10 15 

Ala Val Ala Leu Leu Lys Glu Glu Ala Leu Lys Lys lie lie Ala Ala 

20 25 30 

Lys Thr Ala Lys Leu Arg Ala Lys Gly Ala Leu Thr Asn Leu Gin Asp 
55 35 40 45 

lie Val lie Arg Gin Ser Lys Val Thr Gly Lys Arg Lys Glu Asp Glu 
50 55 60 

Arg 
65 

60 (2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
5 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..770 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388499 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
10 aaaaaactca ttctcataac aattccctta actctatctc taggtgactt aagagtaaaa 60 
aatggtagag gcagaggttg aagtagatgt ggaaatcaaa tctacggctg ataagttttt 120 
catg'ttctct agaagatcac aacatgcctc caaagccact cgctatgtac agggatgtga 180 
tctgctcgaa ggcgagtggg gcgaagttgg aagcatactc ttgtggaaat taactgttga 240 
tggagagcca aaagtgtcaa aggatatgat tgaggcgatt gatatgaaga tgaatatgat 300 
15 ccagtggagg gtgttagagg gacctctgaa ggaagagtac aatattttct cgaaaacgat 360 
gaaagtaagc cctaagcaag gagggtctgg aagtgtggtg aagtggaact tgaagtatga 420 
gagaattgat gaaaaggtgg ctcacctaga gaggctcctt cagttcttcg tcgaatgtgt 480 
taacgagatt gaccaatacc tattgtctga gggttagggg acttttctcc ctggcttatg 540 
tgtgggagag ttattgtatg cgtacgtgta aggactcttc ttgtgtttta tatgtctttc 600 
20 tcaataatgt gatatcttga gatttagaag agaggtatcc aagtgtgcta tacctcatgt 660 
cttcaagttg gaagttgtat aacattttaa gttgcttatg agtttctcat atatgtagtg 720 
tgtttgttgg ttgaaaaatc atattataat gcaacgatac catgtttttt 
(2) INFORMATION FOR SEQ ID NO: 18 4: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

3 0 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 151 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388500 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 
35 Met Val Glu Ala Glu Val Glu Val Asp Val Glu lie Lys Ser Thr Ala 
15 10 15 

Asp Lys Phe Phe Met Phe Ser Arg Arg Ser Gin His Ala Ser Lys Ala 

20 25 30 

Thr Arg Tyr Val Gin Gly Cys Asp Leu Leu Glu Gly Glu Trp Gly Glu 
40 35 40 45 

Val Gly Ser lie Leu Leu Trp .Lys Leu Thr Val Asp Gly Glu Pro Lys 

50 55 60 

Val Ser Lys Asp Met lie Glu Ala lie Asp Met Lys Met Asn Met He 
65 70 75 80 

4 5 Gin Trp Arg Val Leu Glu Gly Pro Leu Lys Glu Glu Tyr Asn He Phe 

85 90 95 

Ser Lys Thr Met Lys Val Ser Pro Lys Gin Gly Gly Ser Gly Ser Val 

100 105 110 

Val Lys Trp Asn Leu Lys Tyr Glu Arg He Asp Glu Lys Val Ala His 
50 115 120 125 

Leu Glu Arg Leu Leu Gin Phe Phe Val Glu Cys Val Asn Glu lie Asp 

130 135 140 

Gin Tyr Leu Leu Ser Glu Gly 
145 150 
55 (2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
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(A) NAME /KEY : peptide 

(B) LOCATION: 1..131 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388501 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 
5 Met Phe Ser Arg Arg Ser Gin His Ala Ser Lys Ala Thr Arg Tyr Val 
15 10 15 

Gin Gly Cys Asp Leu Leu Glu Gly Glu Trp Gly Glu Val Gly Ser lie 

20 25 30 

Leu Leu Trp Lys Leu Thr Val Asp Gly Glu Pro Lys Val Ser Lys Asp 
10 35 40 45 

Met lie Glu Ala lie Asp Met Lys Met Asn Met lie Gin Trp Arg Val 

50 55 60 

Leu Glu Gly Pro Leu Lys Glu Glu Tyr Asn lie Phe Ser Lys Thr Met 
65 70 75 80 

15 Lys Val Ser Pro Lys Gin Gly Gly Ser Gly Ser Val Val Lys Trp Asn 

85 90 95 

Leu Lys Tyr Glu Arg lie Asp Glu Lys Val Ala His Leu Glu Arg Leu 

100 105 110 

Leu Gin Phe Phe Val Glu Cys Val Asn Glu lie Asp Gin Tyr Leu Leu 
20 115 ' 120 125 

Ser Glu Gly 
130 

(2) INFORMATION FOR SEQ ID NO: 18 6: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

3 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1. .83 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388502 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 6: 
35 Met lie Glu Ala lie Asp Met Lys Met Asn Met lie Gin Trp Arg Val 
15 10 15 

Leu Glu Gly Pro Leu Lys Glu Glu Tyr Asn lie Phe Ser Lys Thr Met 

20 25 30 

Lys Val Ser Pro Lys Gin Gly Gly Ser Gly Ser Val Val Lys Trp Asn 
40 35 40 45 

Leu Lys Tyr Glu Arg lie Asp Glu Lys Val Ala His Leu Glu Arg Leu 

50 55 60 

Leu Gin Phe Phe Val Glu Cys Val Asn Glu lie Asp Gin Tyr Leu Leu 
65 70 75 80 

4 5 Ser Glu Gly 

(2) INFORMATION FOR SEQ ID NO: 187: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
55 (A) NAME /KEY : - 

(B) LOCATION: 1 . . 752 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388519 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
aatctcacgc gctttcatca tcttcttcct agattcgctc ctccactatt cgaaaaaatg 60 
60 gcgaaactgg tgatgttgtt ggttctctgt atcttaccgg cgatagccat ggcggcaagg 120 
aggggaaata ttggaaagaa tacaatggtg gttcaaggta gcacctactg tgacatttgc 180 
aaattcggct tcgagactcc tgaatcctcc tacttcatcc ccggtgcaac ggtgaagcta 240 
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tcatgcaaag acaggaagac aatggaagag gtttacacag acaaagctgt atcagacaaa 30 

gaaggaaagt ataagttcat tgtccacgac gatcacagag accagatgtg cgatgttttg 36 

cttgtgaaaa gctcggataa aacctgctct aaaatctccg ttggacgtga gaagtctcgt 42 

gtgatcttga accattacag tggcattgcc tcgcagatca gacatgctaa caacatggga 48 

tttgagaaag aagtgagtga tgtgttctgc tctgctttgt ttcagaagta tatggttgat 54 

gaagatgagg atgatattaa aaaccatctc taatctctct gtttaatctt atgatctgct 60 

gttttcttca ttaatgagtt tccagttatg gaagagatat atttgtattt gtttgattac 66 

ttatttgttg tctttagaga tgttgactct ggtgatcgga taactatctg tttgtgtaag 12 



cttcttatat attgatgtgt catttccttg cc 
(2) INFORMATION FOR SEQ ID NO: 188: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 190 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388520 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 



Asn 


Leu 


Thr 


Arg 


Phe 


His 


His 


Leu 


Leu 


Pro 


Arg 


Phe 


Ala 


Pro 


Pro 


Leu 


1 








5 










10 










15 




Phe 


Glu 


Lys 


Met 


Ala 


Lys 


Leu 


Val 


Met 


Leu 


Leu 


Val 


Leu 


Cys 


lie 


Leu 








20 










25 










30 






Pro 


Ala 


He 


Ala 


Met 


Ala 


Ala 


Arg 


Arg 


Gly 


Asn 


He 


Gly 


Lys 


Asn 


Thr 






35 










40 










45 








Met 


Val 


Val 


Gin 


Gly 


Ser 


Thr 


Tyr 


Cys 


Asp 


He 


Cys 


Lys 


Phe 


Gly 


Phe 




50 










55 










60 










Glu 


Thr 


Pro 


Glu 


Ser 


Ser 


Tyr 


Phe 


He 


Pro 


Gly 


Ala 


Thr 


Val 


Lys 


Leu 


65 










70 










75 










80 


Ser 


Cys 


Lys 


Asp 


Arg 


Lys 


Thr 


Met 


Glu 


Glu 


Val 


Tyr 


Thr 


Asp 


Lys 


Ala 










85 










90 










95 




Val 


Ser 


Asp 


Lys 


Glu 


Gly 


Lys 


Tyr 


Lys 


Phe 


He 


Val 


His 


Asp 


Asp 


His 








100 










105 










110 






Arg 


Asp 


Gin 


Met 


Cys 


Asp 


Val 


Leu 


Leu 


Val 


Lys 


Ser 


Ser 


Asp 


Lys 


Thr 






115 










120 










125 








Cys 


Ser 


Lys 


He 


Ser 


Val 


Gly 


Arg 


Glu 


Lys 


Ser 


Arg 


Val 


He 


Leu 


Asn 




130 










135 










140 










His 


Tyr 


Ser 


Gly 


He 


Ala 


Ser 


Gin 


He 


Arg 


His 


Ala 


Asn 


Asn 


Met 


Gly 


145 










150 










155 










160 


Phe 


Glu 


Lys 


Glu 


Val 


Ser 


Asp 


Val 


Phe 


Cys 


Ser 


Ala 


Leu 


Phe 


Gin 


Lys 










165 










170 










175 




Tyr 


Met 


Val 


Asp 


Glu 


Asp 


Glu 


Asp 


Asp 


He 


Lys 


Asn 


His 


Leu 







180 185 190 



(2) INFORMATION FOR SEQ ID NO: 189: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: ' 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..171 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388521 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 
Met Ala Lys Leu Val Met Leu Leu Val Leu Cys lie Leu Pro Ala He 
15 10 15 

Ala Met Ala Ala Arg Arg Gly Asn He Gly Lys Asn Thr Met Val Val 

20 25 30 

Gin Gly Ser Thr Tyr Cys Asp He Cys Lys Phe Gly Phe Glu Thr Pro 



WO 00/40695 



PCT/US00/00466 



413 







35 










40 










45 








blU 


Ser 
50 


ber 


Tyr 


Pne 


lie 


Pro 
55 


Gly 


Ala 


Thr 


val 


Lys 
60 


Leu 


Ser 


cys 


Lys 


Asp 


Arg 


Lys 


Thr 


Met 


Glu 


Glu 


TT_ 1 

Val 


Tyr 


Thr 


Asp 


Lys 


Ala 


val 


ber 


Asp 


65 










70 










75 










80 


Lys 


Glu 


Gly 


Lys 


Tyr 

o a 

o b 


Lys 


Pne 


He 


Val 


His 
y u 


Asp 


Asp 


His 


Arg 


Asp 


Gin 


Met 


Cys 


Asp 


Val 
100 


Leu 


Leu 


Val 


Lys 


Ser 
105 


Ser 


Asp 


Lys 


Thr 


Cys 
110 


ber 


Lys 


lie 


Ser 


Val 
115 


Gly 


Arg 


Glu 


Lys 


Ser 
120 


Arg 


Val 


He 


Leu 


Asn 
125 


His 


Tyr 


Ser 


Gly 


He 
130 


Ala 


Ser 


Gin 


He 


Arg 
135 


His 


Ala 


Asn 


Asn 


Met 
140 


Gly 


Phe 


Glu 


Lys 


Glu 


Val 


Ser 


Asp 


Val 


Phe 


Cys 


Ser 


Ala 


Leu 


Phe 


Gin 


Lys 


Tyr 


Met 


Val 


145 










150 










155 










160 


Asp 


Glu 


Asp 


Glu 


Asp 
165 


Asp 


He 


Lys 


Asn 


His 
170 


Leu 












(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 190: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 166 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388522 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 



Met 


Leu 


Leu 


Val 


Leu 


Cys 


He 


Leu 


Pro 


Ala 


He 


Ala 


Met 


Ala 


Ala 


Arg 


1 








5 










10 










15 




Arg 


Gly 


Asn 


He 


Gly 


Lys 


Asn 


Thr 


Met 


Val 


Val 


Gin 


Gly 


Ser 


Thr 


Tyr 








20 










25 










30 






Cys 


Asp 


He 


Cys 


Lys 


Phe 


Gly 


Phe 


Glu 


Thr 


Pro 


Glu 


Ser 


Ser 


Tyr 


Phe 






35 










40 










45 








He 


Pro 


Gly 


Ala 


Thr 


Val 


Lys 


Leu 


Ser 


Cys 


Lys 


Asp 


Arg 


Lys 


Thr 


Met 




50 










55 










60 










Glu 


Glu 


Val 


Tyr 


Thr 


Asp 


Lys 


Ala 


Val 


Ser 


Asp 


Lys 


Glu 


Gly 


Lys 


Tyr 


65 










70 










75 










80 


Lys 


Phe 


He 


Val 


His 


Asp 


Asp 


His 


Arg 


Asp 


Gin 


Met 


Cys 


Asp 


Val 


Leu 










85 










90 










95 




Leu 


Val 


Lys 


Ser 


Ser 


Asp 


Lys 


Thr 


Cys 


Ser 


Lys 


He 


Ser 


Val 


Gly Arg 








100 










105 










110 






Glu 


Lys 


Ser 


Arg 


Val 


He 


Leu 


Asn 


His 


Tyr 


Ser 


Gly 


He 


Ala 


Ser 


Gin 






115 










120 










125 








He 


Arg 


His 


Ala 


Asn 


Asn 


Met 


Gly 


Phe 


Glu 


Lys 


Glu 


Val 


Ser 


Asp 


Val 




130 










135 










140 










Phe 


Cys 


Ser 


Ala 


Leu 


Phe 


Gin 


Lys 


Tyr 


Met 


Val 


Asp 


Glu 


Asp 


Glu 


Asp 


145 










150 










155 










160 


Asp 


He 


Lys 


Asn 


His 


Leu 























165 

(2) INFORMATION FOR SEQ ID NO: 191: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..658 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388563 
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(D) OTHER INFORMATION: / Ceres Seq. ID 1388566 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
Met Arg Leu Asn Ser lie Phe Thr Phe Phe Tyr Ser Phe Thr Gly Asn 
15 10 15 

lie Val Lys Leu lie lie Cys Trp Cys Ser Val Leu Asn lie Phe Asn 

20 25 30 

Leu Val Met Asn Met Asp Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 195: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 866 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388793 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
aacggtgtgg gctaatctca cgcgctttca tcatcttctt cctagattcg ctcctccact 



gtcttaaatt tcactatctt cttcttcatt ttccttcctt ctcacttcaa atccataatc 1 

gctctacttc tctctctctc tagattcgaa aaaatggcga aactggtgat gttgttggtt 1 

ctctgtatct taccggcgat agccatggcg gcaaggaggg gaaatattgg aaagaataca 2 

atggtggttc aaggtagcac ctactgtgac atttgcaaat tcggcttcga gactcctgaa 3 

tcctcctact tcatccccgg tgcaacggtg aagctatcat gcaaagacag gaagacaatg 3 

gaagaggttt acacagacaa agctgtatca gacaaagaag gaaagtataa gttcattgtc 4 

cacgacgatc acagagacca gatgtgcgat gttttgcttg tgaaaagctc ggataaaacc 4 

tgctctaaaa tctccgttgg acgtgagaag tctcgtgtga tcttgaacca ttacagtggc 5 

attgcctcgc agatcagaca tgctaacaac atgggatttg agaaagaagt gagtgatgtg 6 

ttctgctctg ctttgtttca gaagtatatg gttgatgaag atgaggatga tattaaaaac 6 

catctctaat ctctctgttt aatcttatga tctgctgttt tcttcattaa tgagtttcga 7 

gttatggaag agatatattt gtatttgttt gattacttat ttgttgtctt tagagatgtt 7 

gactctggtg atcggataac tatctgtttg tgtaagcttc ttatatattg atatgtcatt 8 



tccttgctta ggaagtgtct tttatt 
(2) INFORMATION FOR SEQ ID NO: 196: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 171 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388794 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 



Met 


Ala 


Lys 


Leu 


Val 


Met 


Leu 


Leu 


Val 


Leu 


Cys 


He 


Leu 


Pro 


Ala 


He 


1 








5 










10 










15 




Ala 


Met 


Ala 


Ala 


Arg 


Arg 


Gly 


Asn 


He 


Gly 


Lys 


Asn 


Thr 


Met 


Val 


Val 








20 










25 










30 






Gin 


Gly 


Ser 


Thr 


Tyr 


Cys 


Asp 


He 


Cys 


Lys 


Phe 


Gly 


Phe 


Glu 


Thr 


Pro 






35 










40 










45 








Glu 


Ser 


Ser 


Tyr 


Phe 


He 


Pro 


Gly 


Ala 


Thr 


Val 


Lys 


Leu 


Ser 


Cys 


Lys 




50 










55 










60 










Asp 


Arg 


Lys 


Thr 


Met 


Glu 


Glu 


Val 


Tyr 


Thr 


Asp 


Lys 


Ala 


Val 


Ser 


Asp 


65 










70 










75 










80 


Lys 


Glu 


Gly 


Lys 


Tyr 


Lys 


Phe 


He 


Val 


His 


Asp 


Asp 


His 


Arg 


Asp 


Gin 










85 










90 










95 




Met 


Cys 


Asp 


Val 


Leu 


Leu 


Val 


Lys 


Ser 


Ser 


Asp 


Lys 


Thr 


Cys 


Ser 


Lys 








100 










105 










110 






He 


Ser 


Val 


Gly 


Arg 


Glu 


Lys 


Ser 


Arg 


Val 


He 


Leu 


Asn 


His 


Tyr 


Ser 
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115 120 125 

Gly lie Ala Ser Gin lie Arg His Ala Asn Asn Met Gly Phe Glu Lys 

130 135 140 

Glu Val Ser Asp Val Phe Cys Ser Ala Leu Phe Gin Lys Tyr Met Val 
5 145 150 155 160 

Asp Glu Asp Glu Asp Asp lie Lys Asn His Leu 

165 170 
(2) INFORMATION FOR SEQ ID NO: 197: 
(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

15 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. . 166 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388795 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



20 


Met 
1 


Leu 


Leu 


Val 


Leu 
5 


Cys 


He 


Leu 


Pro 


Ala 
10 


He 


Ala 


Met 


Ala 


Ala Arg 
15 




Arg 


Gly 


Asn 


He 

20 


Gly 


Lys 


Asn 


Thr 


Met 
25 


Val 


Val 


Gin 


Gly 


Ser 
30 


Thr Tyr 




Cys 


Asp 


He 


Cys 


Lys 


Phe 


Gly 


Phe 


Glu 


Thr 


Pro 


Glu 


Ser 


Ser 


Tyr Phe 


25 






35 










40 










45 








He 


Pro 
50 


Gly 


Ala 


Thr 


Val 


Lys 
55 


Leu 


Ser 


Cys 


Lys 


Asp 
60 


Arg 


Lys 


Thr Met 




Glu 


Glu 


Val 


Tyr 


Thr 


Asp 


Lys 


Ala 


Val 


Ser 


Asp 


Lys 


Glu 


Gly 


Lys Tyr 




65 










70 










75 








80 


30 


Lys 


Phe 


He 


Val 


His 
85 


Asp 


Asp 


His 


Arg 


Asp 
90 


Gin 


Met 


Cys 


Asp 


Val Leu 
95 




Leu 


Val 


Lys 


Ser 
100 


Ser 


Asp 


Lys 


Thr 


Cys 
105 


Ser 


Lys 


He 


Ser 


Val 
110 


Gly Arg 




Glu 


Lys 


Ser 


Arg 


Val 


He 


Leu 


Asn 


His 


Tyr 


Ser 


Gly 


He 


Ala 


Ser Gin 


35 






115 










120 










125 








He 


Arg 
130 


His 


Ala 


Asn 


Asn 


Met 
135 


Gly 


Phe 


Glu 


Lys 


Glu 
140 


Val 


Ser 


Asp Val 




Phe 


Cys 


Ser 


Ala 


Leu 


Phe 


Gin 


Lys 


Tyr 


Met 


Val 


Asp 


Glu 


Asp 


Glu Asp 




145 










150 










155 








160 


40 


Asp 


He 


Lys 


Asn 


His 
165 


Leu 






















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 198: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 
45 (B) TYPE: amino acid 



(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

50 (A) NAME/KEY: peptide 

(B) LOCATION: 1..154 

(D) OTHER INFORMATION: / Ceres Seq. ID 1388796 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
Met Ala Ala Arg Arg Gly Asn lie Gly Lys Asn Thr Met Val Val Gin 
55 1 5 10 15 

Gly Ser Thr Tyr Cys Asp He Cys Lys Phe Gly Phe Glu Thr Pro Glu 

20 25 30 

Ser Ser Tyr Phe He Pro Gly Ala Thr Val Lys Leu Ser Cys Lys Asp 
35 40 45 

60 Arg Lys Thr Met Glu Glu Val Tyr Thr Asp Lys Ala Val Ser Asp Lys 
50 55 60 

Glu Gly Lys Tyr Lys Phe He Val His Asp Asp His Arg Asp Gin Met 
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55 



60 



D D 








7 n 










1 R 










o u 


tys 


Asp 


vai Lieu 


Leu 


vai 


Lys 


Ser 


Ser 


Asp 


Lys 


T H -r- 

i nr 


Cys 


Ser 


Lys 


±_i_e 








o 3 




















QR 




Ser 


v 3.1. 


Li-Ly Arg 


C 1 n 


Lys 


Ser 


Arg 


v a J. 


l _i_ e 


Leu 


Asn 


nlS 


Tyr 


Ser 


(j_i_y 






100 










105 










11U 






He 


Ala 


Ser Gin 


He 


Arg 


His 


Ala 


Asn 


Asn 


Met 


Gly 


Phe 


Glu 


Lys 


Glu 






115 








120 










125 








Val 


Ser 


Asp Val 


Phe 


Cys 


Ser 


Ala 


Leu 


Phe 


Gin 


Lys 


Tyr 


Met 


Val 


Asp 




130 








135 










140 










Glu 


Asp 


Glu Asp 


Asp 


He 


Lys 


Asn 


His 


Leu 














145 








150 






















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 199: 

















10 

145 

(2) I 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 
15 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
2 0 (A) NAME/ KEY : - 

(B) LOCATION: 1..533 

(D) OTHER INFORMATION: / Ceres Seq. ID 1392041 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
aaccataaca caaatacacc aaaaaaaaaa aaacagaaag taacatgaga tctctcttat 60 
25 tagccgtgtg cctggttctt gctttacact gcggtgaagc agccgtgtct tgcaacacgg 120 
tgattgcgga tctttaccct tgcttatcct acgtgactca gggcggaccg gtcccaaccc 180 
tctgctgcaa cggtctcaca acactcaaga gtcaggctca aacttctgtg gaccgtcagg 240 
gggtctgtcg ttgcatcaaa tctgctattg gaggactcac tctctctcct agaaccatcc 300 
aaaatgcttt ggaattgcct tctaaatgtg gtgtcgatct cccttacaag ttcagccctt 360 
30 ccactgactg cgacagtatc cagtgagaca agcagaaaat cttaaaggaa gctactacaa 420 
gaactataat aacctaataa ttaataaatg agggcattgg tttgctagtt gctaattgat 480 
cagtgatgta ttgtcatttt gaatgttcta atatcagcag gcacttatct ctg 
(2) INFORMATION FOR SEQ ID NO: 200: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 113 

(D) OTHER INFORMATION: / Ceres Seq. ID 1392042 

(xi) S 

45 



50 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 200 












Met 


Arg 


Ser 


Leu 


Leu 


Leu 


Ala 


Val 


Cys 


Leu 


Val 


Leu 


Ala 


Leu 


His 


Cys 


1 








5 










10 










15 




Gly 


Glu 


Ala 


Ala 


Val 


Ser 


Cys 


Asn 


Thr 


Val 


He 


Ala 


Asp 


Leu 


Tyr 


Pro 








20 










25 










30 






Cys 


Leu 


Ser 


Tyr 


Val 


Thr 


Gin 


Gly 


Gly 


Pro 


Val 


Pro 


Thr 


Leu 


Cys 


Cys 






35 










40 










45 








Asn 


Gly 


Leu 


Thr 


Thr 


Leu 


Lys 


Ser 


Gin 


Ala 


Gin 


Thr 


Ser 


Val 


Asp 


Arg 




50 










55 










60 










Gin 


Gly 


Val 


Cys 


Arg 


Cys 


He 


Lys 


Ser 


Ala 


lie 


Gly 


Gly 


Leu 


Thr 


Leu 


65 










70 










75 










80 


Ser 


Pro 


Arg 


Thr 


He 


Gin 


Asn 


Ala 


Leu 


Glu 


Leu 


Pro 


Ser 


Lys 


Cys 


Gly 










85 










90 










95 




Val 


Asp 


Leu 


Pro 


Tyr 


Lys 


Phe 


Ser 


Pro 


Ser 


Thr 


Asp 


Cys 


Asp 


Ser 


He 



100 105 110 

Gin 

(2) INFORMATION FOR SEQ ID NO: 201: 
(i) SEQUENCE CHARACTERISTICS: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



418 

(A) LENGTH: 1028 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..1028 

(D) OTHER INFORMATION: / Ceres Seq. ID 1393342 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 
aatatgtcct taattgctct tatcggagaa ggcttactct gtttttcagg cgaactctgt 
ttatcttctc cgagctcgta cgagaatatt cgtaaaggtt tcttgtgagc ttaaacgatg 
agtacccttg agaccacaag agctgagctt ggtctggtag ttgtgtattt gaataaagcg 
gaggcgagag acaagatttg tcgagctatt caatatggtt ccaagttctt gagtgatggc 
caacctggca ctgctcaaaa tgtcgacaag aacaccagct tggctaggaa agttttccgt 
ctcttcaagt ttgtaaatga tcttcatgct ctcattagcc ctgttcccaa agggactcca 
ctcccgcttg ttctgctcgg aaagtctaaa aacgcgttgc tgtcaacgtt cttgttccta 
gatcaaattg tgtggcttgg caggactggg atttacaagg acaaagaacg tgctgagatt 
cttggacgta tatcactttt ctgttggatg ggttcttcgg tttgcacatc cttggttgag 
gttggggagc ttggtaggct gtcagcatca attaagaagt tagaaaaaga gatagggaac 
aaggataaac accagaatga gcaataccgt gcaaaagtag agaaatcaaa cgagaggtca 
ttggctctga tcaaagcagg gatggatgta gttgttgctt tcggattgct tcaattggct 
ccaaagaaag tcactccccg agtcacaggt gctttcggat tcgcctcatc acttatctct 
tgttatcagc tattgccgtc acatcccaag tccaagatgg tctgataaaa gggaaacaag 
gaaaggcgaa ctagtttcat tgtccacatg ttccatatat agtttgtgtc tgttttacta 
ttacttggtg ttaaatttgt cttgcttgac tactttaatg tttgtgttag attctatggt 
atataagata atgataaaca ttgcagtctc tgtagaataa taagaacaga taagcattac 
gaggtgtc 

(2) INFORMATION FOR SEQ ID NO: 202: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



peptide 
1. .235 

(D) OTHER INFORMATION: / Ceres Seq. ID 1393343 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
Met Ser Thr Leu Glu Thr Thr Arg Ala Glu Leu Gly Leu Val Val Val 
15 10 15 

Tyr Leu Asn Lys Ala Glu Ala Arg Asp Lys lie Cys Arg Ala lie Gin 

20 25 30 

Tyr Gly Ser Lys Phe Leu Ser Asp Gly Gin Pro Gly Thr Ala Gin Asn 

35 4 0 4 5 

Val Asp Lys Asn Thr Ser Leu Ala Arg Lys Val Phe Arg Leu Phe Lys 

50 55 60 

Phe Val Asn Asp Leu His Ala Leu lie Ser Pro Val Pro Lys Gly Thr 
65 70 75 80 

Pro Leu Pro Leu Val Leu Leu Gly Lys Ser Lys Asn Ala Leu Leu Ser 

85 90 95 

Thr Phe Leu Phe Leu Asp Gin lie Val Trp Leu Gly Arg Thr Gly lie 

100 105 110 

Tyr Lys Asp Lys Glu Arg Ala Glu lie Leu Gly Arg lie Ser Leu Phe 

115 120 125 

Cys Trp Met Gly Ser Ser Val Cys Thr Ser Leu Val Glu Val Gly Glu 

130 135 140 

Leu Gly Arg Leu Ser Ala Ser lie Lys Lys Leu Glu Lys Glu lie Gly 
145 150 '* 155 160 

Asn Lys Asp Lys His Gin Asn Glu Gin Tyr Arg Ala Lys Val Glu Lys 
165 170 175 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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Ser Asn Glu Arg Ser Leu Ala Leu lie Lys Ala Gly Met Asp Val Val 

180 185 " ' 190 

Val Ala Phe Gly Leu Leu Gin Leu Ala Pro Lys Lys Val Thr Pro Arg 

195 " 200 205 

Val Thr Gly Ala Phe Gly Phe Ala Ser Ser Leu lie Ser Cys Tyr Gin 

210 " 215 220 

Leu Leu Pro Ser His Pro Lys Ser Lys Met Val 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 203: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 653 

<D) OTHER INFORMATION: / Ceres Seq. ID 1393556 
SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
tcttcccaat cccaacagct tcatcgtttt atcttccaat caaagcagct caaatcaagc 
tcgcaaagat gatgcacatg accttctact ggggaatcaa agccacaatt ctcttcgatt 
tctggaaaac tgactcatgg cttagttaca tcctcacttt aatcgcttgc ttcgtcttct 
ccgctttcta tcaatacctc gagaatcgcc gcatccaatt caaatccctt tcttcctccc 
gtcgtgctcc tccaccgcct cgctcttcct ccggcgtctc cgcgcctctt atccctaaat 
ccggtaccag atccgccgct aaagctgctt cggttcttct tttcggcgtc aacgcagcga 
tcggttactt gctgatgctt gcagctatgt ctttcaacgg aggtgttttc atcgcgattg 
tcgtcggatt aaccgccgga tacgctgttt ttagatctga tgacggcggt gctgataccg 
ccacggatga tccatgtcca tgtgcttgat aatgattgat aataaaatca aaatcggaaa 
aatctcaaaa aaggtgtgtt tgataattga tgtttgtatt gttgtttgtt tctgtatgat 
tagttgcttt gtaacaaaat aatccaatct aaatgtgttt cttgagtata tgc 
(2) INFORMATION FOR SEQ ID NO: 204: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



(xi) 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1. .168 

(D) OTHER INFORMATION: / Ceres Seq. ID 1393557 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 
Phe Pro lie Pro Thr Ala Ser Ser Phe Tyr Leu Pro lie Lys Ala Ala 
15 10 15 

Gin lie Lys Leu Ala Lys Met Met His Met Thr Phe Tyr Trp Gly lie 

20 25 30 

Lys Ala Thr lie Leu Phe Asp Phe Trp Lys Thr Asp Ser Trp Leu Ser 

35 4 0 " 4 5 

Tyr lie Leu Thr Leu He Ala Cys Phe Val Phe Ser Ala Phe Tyr Gin 

50 55 60 

Tyr Leu Glu Asn Arg Arg He Gin Phe Lys Ser Leu Ser Ser Ser Arg 
65 70 ~ 75 80 

Arg Ala Pro Pro Pro Pro Arg Ser Ser Ser Gly Val Ser Ala Pro Leu 

85 90 95 

He Pro Lys Ser Gly Thr Arg Ser Ala Ala Lys Ala Ala Ser Val Leu 

100 105 110 

Leu Phe Gly Val Asn Ala Ala He Gly Tyr Leu Leu Met Leu Ala Ala 

115 120 125 

Met Ser Phe Asn Gly Gly Val Phe He Ala He Val Val Gly Leu Thr 

130 135 140 

Ala Gly Tyr Ala Val Phe Arg Ser Asp Asp Gly Gly Ala Asp Thr Ala 



6 
12 
18 
24 
30 
36 
42 
48 
54 
60 
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145 150 155 160 

Thr Asp Asp Pro Cys Pro Cys Ala 
165 

(2) INFORMATION FOR SEQ ID NO: 2 05: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
10 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 14 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1393558 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 



Met 


Met 


His 


Met 


Thr 


Phe 


Tyr 


Trp 


Gly 


He 


Lys 


Ala 


Thr 


He 


Leu 


Phe 


1 








5 










10 










15 




Asp 


Phe 


Trp 


Lys 


Thr 


Asp 


Ser 


Trp 


Leu 


Ser 


Tyr 


He 


Leu 


Thr 


Leu 


He 








20 










25 










30 






Ala 


Cys 


Phe 


Val 


Phe 


Ser 


Ala 


Phe 


Tyr 


Gin 


Tyr 


Leu 


Glu 


Asn 


Arg 


Arg 






35 










40 










45 








He 


Gin 


Phe 


Lys 


Ser 


Leu 


Ser 


Ser 


Ser 


Arg 


Arg 


Ala 


Pro 


Pro 


Pro 


Pro 




50 










55 










60 










Arg 


Ser 


Ser 


Ser 


Gly 


Val 


Ser 


Ala 


Pro 


Leu 


He 


Pro 


Lys 


Ser 


Gly 


Thr 


65 










70 










75 










80 


Arg 


Ser 


Ala 


Ala 


Lys 


Ala 


Ala 


Ser 


Val 


Leu 


Leu 


Phe 


Gly 


Val 


Asn 


Ala 










85 










90 










95 




Ala 


He 


Gly 


Tyr 


Leu 


Leu 


Met 


Leu 


Ala 


Ala 


Met 


Ser 


Phe 


Asn 


Gly 


Gly 








100 










105 










110 






Val 


Phe 


He 


Ala 


He 


Val 


Val 


Gly 


Leu 


Thr 


Ala 


Gly 


Tyr 


Ala 


Val 


Phe 






115 










120 










125 








Arg 


Ser 


Asp 


Asp 


Gly 


Gly 


Ala 


Asp 


Thr 


Ala 


Thr 


Asp 


Asp 


Pro 


Cys 


Pro 




130 










135 










140 











Cys Ala 
35 145 

(2) INFORMATION FOR SEQ ID NO: 20 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
4 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
45 (B) LOCATION: 1..145 

(D) OTHER INFORMATION: / Ceres Seq. ID 1393559 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 





Met 


His 


Met 


Thr 


Phe 


Tyr 


Trp 


Gly 


He 


Lys 


Ala 


Thr 


He 


Leu 


Phe 


Asp 




1 








5 










10 










15 




50 


Phe 


Trp 


Lys 


Thr 
20 


Asp 


Ser 


Trp 


Leu 


Ser 
25 


Tyr 


He 


Leu 


Thr 


Leu 

30 


He 


Ala 




Cys 


Phe 


Val 
35 


Phe 


Ser 


Ala 


Phe 


Tyr 
40 


Gin 


Tyr 


Leu 


Glu 


Asn 
45 


Arg 


Arg 


He 




Gin 


Phe 


Lys 


Ser 


Leu 


Ser 


Ser 


Ser 


Arg 


Arg 


Ala 


Pro 


Pro 


Pro 


Pro 


Arg 


55 




50 










55 










60 












Ser 


Ser 


Ser 


Gly 


Val 


Ser 


Ala 


Pro 


Leu 


lie 


Pro 


Lys 


Ser 


Gly 


Thr 


Arg 




65 










70 










75 










80 




Ser 


Ala 


Ala 


Lys 


Ala 
85 


Ala 


Ser 


Val 


Leu 


Leu 
90 


Phe 


Gly 


Val 


Asn 


Ala 
95 


Ala 


60 


He 


Gly 


Tyr 


Leu 
100 


Leu 


Met 


Leu 


Ala 


Ala 
105 


Met 


Ser 


Phe 


Asn 


Gly 
110 


Gly 


Val 




Phe 


He 


Ala 


He 


Val 


Val 


Gly 


Leu 


Thr 


Ala 


Gly 


Tyr 


Ala 


Val 


Phe 


Arg 
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115 120 125 

Ser Asp Asp Gly Gly Ala Asp Thr Ala Thr Asp Asp Pro Cys Pro Cys 
130 135 140 

Ala 
145 

(2) INFORMATION FOR SEQ ID NO: 207: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..525 

(D) OTHER INFORMATION: / Ceres Seq. ID 1396782 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



atgattttta cagacgagag gagacggcga caatggcgaa tttgatgatg agattaccaa 60 

ttagcttgag aagcttctct gtttcagctt cttcatccaa cggttcgccg ccggtgatcg 120 

gaggatctag cggcggtgta ggaccgatga ttgtggaatt accgttggag aagatacgaa 180 

gaccgttgat gcgaaccaga tccaacgatc agaacaaagt gaaagagctt atggatagta 240 

tccgtcaaat cggtcttcaa gttccgattg atgtgattga agttgatgga acttactatg 300 

ggttctcggg atgtcacaga tacgaggcgc atcagaagct aggtcttcca acaatacgtt 360 

gcaaaatacg taaaggaaca aaggaaacat taaggcatca tcttcgctga agagagtttt 420 

acttttatgt atattgtttg tatgatagga actcgagtat gtgtatgaaa gttaagaata 480 



agaactttgt acaactttgt gatagattta gtgactggat tctgc 
(2) INFORMATION FOR SEQ ID NO: 208: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..135 

(D) OTHER INFORMATION: / Ceres Seq. ID 1396783 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 



Asp 


Phe 


Tyr 


Arg 


Arg 


Glu 


Glu 


Thr 


Ala 


Thr 


Met 


Ala 


Asn 


Leu 


Met 


Met 


1 








5 










10 










15 




Arg 


Leu 


Pro 


He 


Ser 


Leu 


Arg 


Ser 


Phe 


Ser 


Val 


Ser 


Ala 


Ser 


Ser 


Ser 








20 










25 










30 






Asn 


Gly 


Ser 


Pro 


Pro 


Val 


He 


Gly 


Gly 


Ser 


Ser 


Gly 


Gly 


Val 


Gly 


Pro 






35 










40 










45 








Met 


He 


Val 


Glu 


Leu 


Pro 


Leu 


Glu 


Lys 


He 


Arg 


Arg 


Pro 


Leu 


Met 


Arg 




50 










55 










60 










Thr 


Arg 


Ser 


Asn 


Asp 


Gin 


Asn 


Lys 


Val 


Lys 


Glu 


Leu 


Met 


Asp 


Ser 


He 


65 










70 










75 










80 


Arg 


Gin 


He 


Gly 


Leu 


Gin 


Val 


Pro 


He 


Asp 


Val 


He 


Glu 


Val 


Asp 


Gly 










85 










90 










95 




Thr 


Tyr 


Tyr 


Gly 


Phe 


Ser 


Gly 


Cys 


His 


Arg 


Tyr 


Glu 


Ala 


His 


Gin 


Lys 








100 










105 










110 






Leu 


Gly 


Leu 


Pro 


Thr 


He 


Arg 


Cys 


Lys 


He 


Arg 


Lys 


Gly 


Thr 


Lys 


Glu 






115 










120 










125 








Thr 


Leu 


Arg 


His 


His 


Leu 


Arg 






















130 










135 




















(2) 


INFORMATION 


FOR 


SEQ 


id : 


NO : 2 0 9 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



WO 00/40695 



PCT/US00/00466 



422 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..125 

(D) OTHER INFORMATION : ■ / Ceres Seq. ID 1396784 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
Met Ala Asn Leu Met Met Arg Leu Pro lie Ser Leu Arg Ser 
15 10 

Val Ser Ala Ser Ser Ser Asn Gly Ser Pro Pro Val lie Gly 

20 25 30 

Ser Gly Gly Val Gly Pro Met lie Val Glu Leu Pro Leu Glu 

^35 "* 4 0 4 5 

Arg Arg Pro Leu Met Arg Thr Arg Ser Asn Asp Gin Asn Lys 

50 55 60 

Glu Leu Met Asp Ser lie Arg Gin lie Gly Leu Gin Val Pro 
65 70 " 75 

Val lie Glu Val Asp Gly Thr Tyr Tyr Gly Phe Ser Gly Cys 

85 "* 90 
Tyr Glu Ala His Gin Lys Leu Gly Leu Pro Thr lie Arg Cys 
100 ' 105 110 

Arg Lys Gly Thr Lys Glu Thr Leu Arg His His Leu Arg 
115 ' 120 125 

(2) INFORMATION FOR SEQ ID NO:210: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..121 

(D) OTHER INFORMATION: / Ceres Seq. ID 1396785 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: 



Met 


Met 


Arg 


Leu 


Pro 


He 


Ser 


Leu 


Arg 


Ser 


Phe 


Ser 


Val 


Ser 


Ala 


Ser 


1 








5 










10 










15 




Ser 


Ser 


Asn 


Gly 
20 


Ser 


Pro 


Pro 


Val 


He 
25 


Gly 


Gly 


Ser 


Ser 


Gly 
30 


Gly 


Val 


Gly 


Pro 


Met 

35 


He 


Val 


Glu 


Leu 


Pro 
40 


Leu 


Glu 


Lys 


He 


Arg 
45 


Arg 


Pro 


Leu 


Met 


Arg 
50 


Thr 


Arg 


Ser 


Asn 


Asp 
55 


Gin 


Asn 


Lys 


Val 


Lys 
60 


Glu 


Leu 


Met 


Asp 


Ser 


He 


Arg 


Gin 


He 


Gly 


Leu 


Gin 


Val 


Pro 


He 


Asp 


Val 


He 


Glu 


Val 


65 










70 










75 










80 


Asp 


Gly 


Thr 


Tyr 


Tyr 
85 


Gly 


Phe 


Ser 


Gly 


Cys 
90 


His 


Arg 


Tyr 


Glu 


Ala 
95 


His 


Gin 


Lys 


Leu 


Gly 
100 


Leu 


Pro 


Thr 


He 


Arg 
105 


Cys 


Lys 


He 


Arg 


Lys 
110 


Gly 


Thr 


Lys 


Glu 


Thr 
115 


Leu 


Arg 


His 


His 


Leu 
120 


Arg 
















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO:211: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 939 

(D) OTHER INFORMATION: / Ceres Seq. ID 1396802 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: 



Phe Ser 
15 

Gly Ser 

Lys He 

Val Lys 

He Asp 
80 

His Arg 
95 

Lys He 
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ataacaaacc agggtcaggg ttttatcttc catattcttt tcccactgat tcttacaatc 60 
gatctaaggg aatctacgaa ctgtttggta atttctggtg ggaaaagtct tgatcttgct 120 
gtttggtttg gagtaatggt gagtgcttgg tttgttgttg gctcaccggt gatggactct 180 
tcgagcagtc catgtttatg tctggatgct cacacgacgg gtaccatcag gaggaagaag 240 
atccttggca aagcaagaaa tctggaatta ggaagctctt tcactggctc acgcattgtt 300 
tttaggcttt ctccgaagcg tgtatcgaga attgcgaatc gaaagagcaa aaagctcttg 360 
attgtgaatg aggatgttgc tggtaattac gacgatacct tcggtgatgt acaaaagcaa 420 
attgttaatt attttacata caaagctgtg aggacggttc ttcatcagct gtacgaaatg 480 
aatcctcctc aatacacttg gttctataat catatcataa caaacagacc gacggatggc 540 
aaacgtttcc tccgtgccct cggcaaggag agtcaggagc ttgcagaaag agtgatgatc 600 
acgcgtcttc acttgtatgg caaatggatc aagaaatgcg accatgggaa gatataccag 660 
gaaatatcgg atgaaaactt ggcgttgatg cgtgaacgcc tgatggagac cgtgatatgg 720 
ccttccgatg acacaaactc aaggtaatag gctgaagaag acagcgcaca ccacagcaca 780 
acatttgctt tgtcaatttg cttctcagtt tttaggactt cgagtttttt agtttgatta 840 
catattacta tagtgaaaga gcagtgtgta tatgatgtct attcttggca tgtactatat 900 
gatattaggg gcaaaaaaca acacccttct tgtgctcct 
(2) INFORMATION FOR SEQ ID NO: 212: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



peptide 
1. .248 

(D) OTHER INFORMATION: / Ceres Seq. ID 1396803 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 
lie Thr Asn Gin Gly Gin Gly Phe lie Phe His lie Leu Phe Pro Leu 
15 10 15 

lie Leu Thr lie Asp Leu Arg Glu Ser Thr Asn Cys Leu Val lie Ser 

20 " 25 30 

Gly Gly Lys Ser Leu Asp Leu Ala Val Trp Phe Gly Val Met Val Ser 

35 40 45 

Ala Trp Phe Val Val Gly Ser Pro Val Met Asp Ser Ser Ser Ser Pro 

50 " 55 60 

Cys Leu Cys Leu Asp Ala His Thr Thr Gly Thr lie Arg Arg Lys Lys 
65 70 75 80 

lie Leu Gly Lys Ala Arg Asn Leu Glu Leu Gly Ser Ser Phe Thr Gly 

85 90 95 

Ser Arg lie Val Phe Arg Leu Ser Pro Lys Arg Val Ser Arg lie Ala 

100 105 110 

Asn Arg Lys Ser Lys Lys Leu Leu lie Val Asn Glu Asp Val Ala Gly 

115 " "* 120 125 

Asn Tyr Asp Asp Thr Phe Gly Asp Val Gin Lys Gin lie Val Asn Tyr 

130 135 140 

Phe Thr Tyr Lys Ala Val Arg Thr Val Leu His Gin Leu Tyr Glu Met 
145 150 155 160 

Asn Pro Pro Gin Tyr Thr Trp Phe Tyr Asn His lie lie Thr Asn Arg 

165 ~ 170 175 

Pro Thr Asp Gly Lys Arg Phe Leu Arg Ala Leu Gly Lys Glu Ser Gin 

180 " 185 190 

Glu Leu Ala Glu Arg Val Met lie Thr Arg Leu His Leu Tyr Gly Lys 

195 200 205 

Trp lie Lys Lys Cys Asp His Gly Lys lie Tyr Gin Glu lie Ser Asp 

210 215 220 

Glu Asn Leu Ala Leu Met Arg Glu Arg Leu Met Glu Thr Val lie Trp 
225 230 235 240 

Pro Ser Asp Asp Thr Asn Ser Arg 
245 

(2) INFORMATION FOR SEQ ID NO: 213: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..203 

(D) OTHER INFORMATION: / Ceres Seq. ID 1396804 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

Met Val Ser Ala Trp Phe Val Val Gly Ser Pro Val Met Asp Ser Ser 
15 10 15 

Ser Ser Pro Cys Leu Cys Leu Asp Ala His Thr Thr Gly Thr lie Arg 
20 ~ 25 30 

15 Arg Lys Lys lie Leu Gly Lys Ala Arg Asn Leu Glu Leu Gly Ser Ser 
35 ~ ~ 40 45 

Phe Thr Gly Ser Arg He Val Phe Arg Leu Ser Pro Lys Arg Val Ser 

50 55 60 

Arg He Ala Asn Arg Lys Ser Lys Lys Leu Leu He Val Asn Glu Asp 
20 65 70 75 80 

Val Ala Gly Asn Tyr Asp Asp Thr Phe Gly Asp Val Gin Lys Gin He 

85 90 95 

Val Asn Tyr Phe Thr Tyr Lys Ala Val Arg Thr Val Leu His Gin Leu 
100 105 110 

25 Tyr Glu Met Asn Pro Pro Gin Tyr Thr Trp Phe Tyr Asn His He He 
115 120 125 

Thr Asn Arg Pro Thr Asp Gly Lys Arg Phe Leu Arg Ala Leu Gly Lys 

130 135 140 

Glu Ser Gin Glu Leu Ala Glu Arg Val Met He Thr Arg Leu His Leu 
30 145 150 155 160 

Tyr Gly Lys Trp He Lys Lys Cys Asp His Gly Lys He Tyr Gin Glu 

165 170 175 

He Ser Asp Glu Asn Leu Ala Leu Met Arg Glu Arg Leu Met Glu Thr 
180 185 190 

35 Val He Trp Pro Ser Asp Asp Thr Asn Ser Arg 
195 200 
(2) INFORMATION FOR SEQ ID NO: 214: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 
4 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 5 (A) NAME /KEY : peptide 

(B) LOCATION: 1. .191 

(D) OTHER INFORMATION: / Ceres Seq. ID 1396805 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 
Met Asp Ser Ser Ser Ser Pro Cys Leu Cys Leu Asp Ala His Thr Thr 
50 1 5 10 15 

Gly Thr He Arg Arg Lys Lys He Leu Gly Lys Ala Arg Asn Leu Glu 

20 25 30 

Leu Gly Ser Ser Phe Thr Gly Ser Arg He Val Phe Arg Leu Ser Pro 
35 40 45 

55 Lys Arg Val Ser Arg He Ala Asn Arg Lys Ser Lys Lys Leu Leu He 
50 55 ^60 

Val Asn Glu Asp Val Ala Gly Asn Tyr Asp Asp Thr Phe Gly Asp Val 
65 70 75 80 

Gin Lys Gin He Val Asn Tyr Phe Thr Tyr Lys Ala Val Arg Thr Val 
60 85 90 95 

Leu His Gin Leu Tyr Glu Met Asn Pro Pro Gin Tyr Thr Trp Phe Tyr 
100 105 110 
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Asn 


His 


He 
115 


He 


Thr 


Ben 


Arg 


Pro 
120 


Thr 


Asp 


Gly 


Lys 


Arg 
125 


Phe 


Leu 


Arg 


Ala 


130 


Gly 


Ly s 


Glu 




Gin 
135 


Glu 




Ala 


Glu 


Arg 
140 


Val 


Met 


He 


Thr 


Arg 


Leu 


His 


Leu 


Tyr 


Gly 


Lys 


Trp 


He 


Lys 


Lys 


Cys 


Asp 


His 


Gly 


Lys 


145 










150 










155 










160 


He 


Tyr 


Gin 


Glu 


He 
165 


Ser 


Asp 


Glu 


Asn 


Leu 
170 


Ala 


Leu 


Met 


Arg 


Glu 
175 


Arg 


Leu 


Met 


Glu 


Thr 
180 


Val 


He 


Trp 


Pro 


Ser 
185 


Asp 


Asp 


Thr 


Asn 


Ser 
190 


Arg 





(2) INFORMATION FOR SEQ ID NO:215: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..7 53 

(D) OTHER INFORMATION: / Ceres Seq. ID 1397130 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 



aaagatttgg agaaaaagtt tactaaaaaa gaatctgggg aagtaagtga gatgaaggaa 60 

gaggaggaga teggaaaace agegaaacca aaagecaaga aagatgttgc tcctgggagg 120 

ttgattgata cctatgctgc acagtgegat aactgtcaca agtggagggt gattgatagc 180 

caggaggaat atgaagatat cagaagtaaa atgetcgagg atccttttaa ctgtcagaag 240 

aaacagggca tgtcttgtga agagectget gatattgact acgattcttc teggacttgg 300 

gtcattgaca agectggtet ccccaaaacg cctaaaggtt tcaagagaag cttagttctc 360 

agaaaagatt actctaagat ggatacctac tactttactc etacegggaa gaagctcagg 420 

agtcgcaatg aaatcgctgc cttcgttgaa gccaatccgg aattcaggaa cgcaccactt 480 

ggagacttca atttcactgt ccccaaggtc atggaagata ctgttccccc tgatccgaag 540 

cttggctctc cttttccaag cactactacc actacttcag agaagagcag tgtcaagcag 600 

agecataact aatatgttct cttctccggg cctttttcta ctttctttct gctccctaaa 660 

geaatgegtt ttgagtcttt atagatttga tgttctctaa aactgttatg atattaacat 720 



ctctaagcaa aaaaatatga tcttttgtct etc 
(2) INFORMATION FOR SEQ ID NO : 2 1 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 



(A) NAME /KEY : peptide 

(B) LOCATION: 1..203 

(D) OTHER INFORMATION: / Ceres Seq. ID 1397131 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 216 


> : 










Lys 


Asp 


Leu 


Glu 


Lys 


Lys 


Phe 


Thr 


Lys 


Lys 


Glu 


Ser 


Gly 


Glu 


Val 


Ser 


1 








5 










10 










15 




Glu 


Met 


Lys 


Glu 


Glu 


Glu 


Glu 


He 


Gly 


Lys 


Pro 


Ala 


Lys 


Pro 


Lys 


Ala 








20 










25 










30 






Lys 


Lys 


Asp 


Val 


Ala 


Pro 


Gly 


Arg 


Leu 


He 


Asp 


Thr 


Tyr 


Ala 


Ala 


Gin 






35 










40 










45 








Cys 


Asp 


Asn 


Cys 


His 


Lys 


Trp 


Arg 


Val 


He 


Asp 


Ser 


Gin 


Glu 


Glu 


Tyr 




50 










55 










60 










Glu 


Asp 


He 


Arg 


Ser 


Lys 


Met 


Leu 


Glu 


Asp 


Pro 


Phe 


Asn 


Cys 


Gin 


Lys 


65 










70 










75 










80 


Lys 


Gin 


Gly 


Met 


Ser 


Cys 


Glu 


Glu 


Pro 


Ala 


Asp 


He 


Asp 


Tyr 


Asp 


Ser 










85 










90 










95 




Ser 


Arg 


Thr 


Trp 


Val 


He 


Asp 


Lys 


Pro 


Gly 


Leu 


Pro 


Lys 


Thr 


Pro 


Lys 








100 










105 










110 






Gly 


Phe 


Lys 


Arg 


Ser 


Leu 


Val 


Leu 


Arg 


Lys 


Asp 


Tyr 


Ser 


Lys 


Met 


Asp 
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115 120 125 

Thr Tyr Tyr Phe Thr Pro Thr Gly Lys Lys Leu Arg Ser Arg Asn Glu 

130 135 140 

lie Ala Ala Phe Val Glu Ala Asn Pro Glu Phe Arg Asn Ala Pro Leu 
5 145 150 155 160 

Gly Asp Phe Asn Phe Thr Val Pro Lys Val Met Glu Asp Thr Val Pro 

165 170 175 

Pro Asp Pro Lys Leu Gly Ser Pro Phe Pro Ser Thr Thr Thr Thr Thr 
180 185 190 

10 Ser Glu Lys Ser Ser Val Lys Gin Ser His Asn 
195 200 
(2) INFORMATION FOR SEQ ID NO: 217: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

2 0 (A) NAME/KEY: peptide 

(B) LOCATION: 1..186 

(D) OTHER INFORMATION: / Ceres Seq. ID 1397132 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 
Met Lys Glu Glu Glu Glu lie Gly Lys Pro Ala Lys Pro Lys Ala Lys 
25 1 5 10 ^ 15 

Lys Asp Val Ala Pro Gly Arg Leu lie Asp Thr Tyr Ala Ala Gin Cys 

20 25 30 

Asp Asn Cys His Lys Trp Arg Val lie Asp Ser Gin Glu Glu Tyr Glu 
35 40 45 

30 Asp He Arg Ser Lys Met Leu Glu Asp Pro Phe Asn Cys Gin Lys Lys 
50 55 60 

Gin Gly Met Ser Cys Glu Glu Pro Ala Asp He Asp Tyr Asp Ser Ser 
65 70 75 80 

Arg Thr Trp Val He Asp Lys Pro Gly Leu Pro Lys Thr Pro Lys Gly 
35 85 ~ 90 95 

Phe Lys Arg Ser Leu Val Leu Arg Lys Asp Tyr Ser Lys Met Asp Thr 

100 105 " 110 

Tyr Tyr Phe Thr Pro Thr Gly Lys Lys Leu Arg Ser Arg Asn Glu He 
115 120 ' 125 

4 0 Ala Ala Phe Val Glu Ala Asn Pro Glu Phe Arg Asn Ala Pro Leu Gly 

130 135 . 140 

Asp Phe Asn Phe Thr Val Pro Lys Val Met Glu Asp Thr Val Pro Pro 
145 150 ~ 155 160 

Asp Pro Lys Leu Gly Ser Pro Phe Pro Ser Thr Thr Thr Thr Thr Ser 
45 165 170 175 

Glu Lys Ser Ser Val Lys Gin Ser His Asn 

180 185 
(2) INFORMATION FOR SEQ ID NO:218: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..133 

(D) OTHER INFORMATION: / Ceres Seq. ID 1397133 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 
60 Met Leu Glu Asp Pro Phe Asn Cys Gin Lys Lys Gin Gly Met Ser Cys 
1 5 " 10 ~ ~ 15 

Glu Glu Pro Ala Asp He Asp Tyr Asp Ser Ser Arg Thr Trp Val He 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



427 

20 25 30 

Asp Lys Pro Gly Leu Pro Lys Thr Pro Lys Gly Phe Lys Arg Ser Leu 

35 4 0 4 5 

Val Leu Arg Lys Asp Tyr Ser Lys Met Asp Thr Tyr Tyr Phe Thr Pro 

50 "* 55 60 

Thr Gly Lys Lys Leu Arg Ser Arg Asn Glu lie Ala Ala Phe Val Glu 
65 70 75 80 

Ala Asn Pro Glu Phe Arg Asn Ala Pro Leu Gly Asp Phe Asn Phe Thr 

8 5 90 " 95 

Val Pro Lys Val Met Glu Asp Thr Val Pro Pro Asp Pro Lys Leu Gly 

100 105 110 

Ser Pro Phe Pro Ser Thr Thr Thr Thr Thr Ser Glu Lys Ser Ser Val 

115 120 125 

Lys Gin Ser His Asn 
130 

(2) INFORMATION FOR SEQ ID NO:219: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..784 

(D) OTHER INFORMATION: / Ceres Seq. ID 1398004 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:219: 
actcatcaac caaaacaaaa cataaaaaaa caagtggaag ctttaaaacg agagggagag 
agcaaaaatg gcgacgtcgg gaacgtacgt gacggaagtt ccgctaaaag gatcggccga 
gaaatactac aagaggtgga agaacgagaa ccatgtcttc cctgatgcta tcggccacca 
catccaaaat gttaccgttc acgaaggcga acatgactct cacgggtcta tcaggagttg 
gaactacaca tgggatggaa aggaggaggt gttcaaggag agaagagaga tagacgatga 
gaccaaaacg ttgacgttaa gaggacttga gggtcacgtg atggagcagc tcaaagtgta 
cgacgtcgtc taccaattca ttcccaaatc tgaggatacc tgcatcggca aaatcacttt 
aatatgggag aagcgcaacg atgattcccc agaaccaagc ggctacataa aattcgtcaa 
gagcttggtt gctgacatgg gaaaccacgt tagcaaaact taatcatcat tcccacagtc 
gtcgtcgtcg tcatcatcat catcatcatc atcatcatca tcatcatcat catcatcatc 
atcatcatca ctatctcgat ttataagtta agatgttttc agtataataa atggggtctt 
gtggatcgtt catttctatg tgtaaaccgt ttggttctgt atgatgcttc gatatattgt 
tatgttcatg atcatatgtc gggttcgata taatgattct taagattaat ttactacaca 
tttc 

(2) INFORMATION FOR SEQ ID NO: 220: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..151 

(D) OTHER INFORMATION: / Ceres Seq. ID 1398005 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 
Met Ala Thr Ser Gly Thr Tyr Val Thr Glu Val Pro Leu Lys Gly Ser 
1 5 10 15 

Ala Glu Lys Tyr Tyr Lys Arg Trp Lys Asn Glu Asn His Val Phe Pro 

20 " " 25 30 

Asp Ala lie Gly His His lie Gin Asn Val Thr Val His Glu Gly Glu 

35 4 0 4 5 

His Asp Ser His Gly Ser lie Arg Ser Trp Asn Tyr Thr Trp Asp Gly 

50 55 60 

Lys Glu Glu Val Phe Lys Glu Arg Arg Glu lie Asp Asp Glu Thr Lys 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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65 










70 










75 










80 


Thr 


Leu 


Thr 


Leu 


Arg 
85 


Gly 


Leu 


Glu 


Gly 


His 
90 


Val 


Met 


Glu 


Gin 


Leu 
95 


Lys 


Val 


Tyr 


Asp 


Val 
100 


Val 


Tyr 


Gin 


Phe 


He 
105 


Pro 


Lys 


Ser 


Glu 


Asp 
110 


Thr 


Cys 


He 


Gly 


Lys 
115 


He 


Thr 


Leu 


He 


Trp 
120 


Glu 


Lys 


Arg 


Asn 


Asp 
125 


Asp 


Ser 


Pro 


Glu 


Pro 
130 


Ser 


Gly 


Tyr 


He 


Lys 
135 


Phe 


Val 


Lys 


Ser 


Leu 
140 


Val 


Ala 


Asp 


Met 


Gly 


Asn 


His 


Val 


Ser 


Lys 


Thr 





















145 150 
(2) INFORMATION FOR SEQ ID NO: 221: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..389 

(D) OTHER INFORMATION: / Ceres Seq. ID 1399370 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



aaaaactttt tcagagtcgg aggtttcaga cagagaggaa gaagatgaag cggacggtga 6 

tggacaacgc gattcgatca tcggtggtgg tgttgggatc attggccttt ggttacttgt 12 

cactggagct tggttacaag cctttccttg aaaaggctga acaatacgaa agatctcttc 18 

agtcttctca acaacatcaa caacaagatg aacaagaaga agcgaggtgg gacaatagca 24 

atgtcgaggg gtgggaggag aaaaggtagt cttataatcg gtttcaatac agacatatca 30 

aagtgtgtaa ggaagaagaa ccttatcaaa attttcttct taatcaaccg taagttcaat 36 



ctgttaacta tcaaccatgg cctgttttt 
(2) INFORMATION FOR SEQ ID NO: 222: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..8 8 

(D) OTHER INFORMATION: / Ceres Seq. ID 1399371 





(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO:222 












Lys 


Leu 


Phe 


Gin 


Ser 


Arg 


Arg 


Phe 


Gin 


Thr 


Glu 


Arg 


Lys 


Lys 


Met 


Lys 


1 








5 










10 










15 




Arg 


Thr 


Val 


Met 


Asp 


Asn 


Ala 


He 


Arg 


Ser 


Ser 


Val 


Val 


Val 


Leu 


Gly 








20 










25 










30 






Ser 


Leu 


Ala 


Phe 


Gly 


Tyr 


Leu 


Ser 


Leu 


Glu 


Leu 


Gly 


Tyr 


Lys 


Pro 


Phe 






35 










40 










45 








Leu 


Glu 


Lys 


Ala 


Glu 


Gin 


Tyr 


Glu 


Arg 


Ser 


Leu 


Gin 


Ser 


Ser 


Gin 


Gin 




50 










55 










60 










His 


Gin 


Gin 


Gin 


Asp 


Glu 


Gin 


Glu 


Glu 


Ala 


Arg 


Trp 


Asp 


Asn 


Ser 


Asn 


65 










70 










75 










80 


Val 


Glu 


Gly 


Trp 


Glu 
85 


Glu 


Lys 


Arg 


















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:223: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(A) NAME/KEY: peptide 

(B) LOCATION: 1..7 4 
(D) OTHER INFORMATION: 



429 



/ Ceres Seq. ID 1399372 





(xi) 


i SEQUENCE DESCRIPTION: SEQ 


ID NO:223: 










Met 


Lys 


Arg Thr 


Val 


Met 


Asp Asn Ala 


lie Arg 


Ser 


Ser 


Val 


Val 


Val 


1 






5 






10 








15 




Leu 


Gly 


Ser Leu 


Ala 


Phe 


Gly Tyr Leu 


Ser Leu 


Glu 


Leu 


Gly 


Tyr 


Lys 






20 






2 5 








30 






Pro 


Phe 


Leu Glu 


Lys 


Ala 


Glu Gin Tyr 


Glu Arg 


Ser 


Leu 


Gin 


Ser 


Ser 






35 






40 






45 








Gin 


Gin 


His Gin 


Gin 


Gin 


Asp Glu Gin 


Glu Glu 


Ala 


Arg 


Trp 


Asp 


Asn 




50 








55 




60 










Ser 


Asn 


Val Glu 


Gly 


Trp 


Glu Glu Lys 


Arg 












65 








70 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:224 : 















(i) 



(ii) 
(ix) 



(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

( B ) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..69 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 224: 



ID 1399373 



Met 
1 



Asp Asn Ala lie Arg Ser Ser Val Val Val Leu Gly Ser Leu Ala 



10 



15 



Phe Gly Tyr Leu Ser Leu Glu Leu Gly Tyr Lys Pro Phe Leu Glu Lys 



20 



25 



30 



Ala Glu Gin Tyr Glu Arg Ser Leu Gin Ser Ser Gin Gin His Gin Gin 



35 



40 



45 



(ii) 
(ix) 



Gin Asp Glu Gin Glu Glu Ala Ara Trp Asp Asn Ser Asn Val Glu Gly 

50 55 60 

Trp Glu Glu Lys Arg 
65 

(2) INFORMATION FOR SEQ ID NO: 225: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..745 

(D) OTHER INFORMATION: / Ceres Seq. ID 1425147 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
atttcttcgt ctctgtctcc aaaatcgaat caaaatctct aaagtttcaa tttttttgtt 
ctgttctttt ttttttttta aagaatggct tcaatttctg caactttgcc ttcgccattg 
ttactcacac agagaaaatc caatctcaca tcgattcaaa aactcccatt ttctctaact 
cgaggtacga atgatctttc tccattatct cttactcgaa accctagcag catcagtctg 
atggtgaaat ctagtggaga aagctcagat tcatcgactg atctcgacgt tgttagtacg 
attcagaatg tttgggataa gtctgaagat aggttaggtc ttattggttt gggttttgct 
ggtattgtag ctctttgggc atcattgaat ctcatcacgg caattgacaa attgcccgtt 
atctcgagcg gattcgaact agttggtatc ttgttctcca cgtggttcac atatcgatat 
ctcttgttca aaccggacag acaggagctt tcgaaaattg tcaagaaatc agtagcggat 
atacttggcc agtgaacctt gtgtgtgtga taatacttca tctttggaag atgatttgtt 
tgcaagtttg taaaattaca tgacagggtg gttgttgttt ctagtccaat aatgtcatgc 
atttgaaacc tgtaaatact ttattgttgg tttttggttg tgagcaaaat caatcttttc 
taatttcaaa gattctcttt tatgt 
(2) INFORMATION FOR SEQ ID NO: 226: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..156 

(D) OTHER INFORMATION: / Ceres Seq. ID 1425148 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 



Met 


Ala 


Ser 


He 


Ser 


Ala 


Thr 


Leu 


Pro 


Ser 


Pro 


Leu 


Leu 


Leu 


Thr 


Gin 


1 








5 










10 










15 




Arg 


Lys 


Ser 


Asn 


Leu 


Thr 


Ser 


He 


Gin 


Lys 


Leu 


Pro 


Phe 


Ser 


Leu 


Thr 








20 










25 










30 






Arg 


Gly 


Thr 


Asn 


Asp 


Leu 


Ser 


Pro 


Leu 


Ser 


Leu 


Thr 


Arg 


Asn 


Pro 


Ser 






35 










40 










45 








Ser 


He 


Ser 


Leu 


Met 


Val 


Lys 


Ser 


Ser 


Gly 


Glu 


Ser 


Ser 


Asp 


Ser 


Ser 




50 










55 










60 










Thr 


Asp 


Leu 


Asp 


Val 


Val 


Ser 


Thr 


He 


Gin 


Asn 


Val 


Trp 


Asp 


Lys 


Ser 


65 










70 










75 










80 


Glu 


Asp 


Arg 


Leu 


Gly 


Leu 


He 


Gly 


Leu 


Gly 


Phe 


Ala 


Gly 


He 


T 7 - 1 

v a 1 


Ala 










85 










90 










95 




Leu 


Trp 


Ala 


Ser 


Leu 


Asn 


Leu 


He 


Thr 


Ala 


He 


Asp 


Lys 


Leu 


Pro 


Val 








100 










105 










110 






He 


Ser 


Ser 


Gly 


Phe 


Glu 


Leu 


Val 


Gly 


He 


Leu 


Phe 


Ser 


Thr 


Trp 


Phe 






115 










120 










125 








Thr 


Tyr 


Arg 


Tyr 


Leu 


Leu 


Phe 


Lys 


Pro 


Asp 


Arg 


Gin 


Glu 


Leu 


Ser 


Lys 




130 










135 










140 










He 


Val 


Lys 


Lys 


Ser 


Val 


Ala 


Asp 


He 


Leu 


Gly 


Gin 











145 150 155 

(2) INFORMATION FOR SEQ ID NO: 227: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..104 

(D) OTHER INFORMATION: / Ceres Seq. ID 1425149 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 



Met 


Val 


Lys 


Ser 


Ser 


Gly 


Glu 


Ser 


Ser 


Asp 


Ser 


Ser 


Thr 


Asp 


Leu 


Asp 


1 








5 










10 










15 




Val 


Val 


Ser 


Thr 
20 


He 


Gin 


Asn 


Val 


Trp 

25 


Asp 


Lys 


Ser 


Glu 


Asp 
30 


Arg 


Leu 


Gly 


Leu 


He 
35 


Gly 


Leu 


Gly 


Phe 


Ala 
40 


Gly 


He 


Val 


Ala 


Leu 
45 


Trp 


Ala 


Ser 


Leu 


Asn 
50 


Leu 


He 


Thr 


Ala 


He 

55 


Asp 


Lys 


Leu 


Pro 


Val 
60 


He 


Ser 


Ser 


Gly 


Phe 


Glu 


Leu 


Val 


Gly 


He 


Leu 


Phe 


Ser 


Thr 


Trp 


Phe 


Thr 


Tyr 


A.rg 


Tyr 


65 










70 










75 










80 


Leu 


Leu 


Phe 


Lys 


Pro 
85 


Asp 


Arg 


Gin 


Glu 


Leu 
90 


Ser 


Lys 


He 


Val 


Lys 
95 


Lys 


Ser 


Val 


Ala 


Asp 
100 


He 


Leu 


Gly 


Gin 



















(2) INFORMATION FOR SEQ ID NO: 228: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



WO 00/40695 



PCT/US00/00466 



431 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
5 (B) LOCATION: 1..769 

(D) OTHER INFORMATION: / Ceres Seq. ID 1441102 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 
ccacaacaca acttcagctt taagacagct aaaactaaat acaaagaaac 
ttggttacag aggaggtgag agctaaggca gagatgtaca ccggagatga 
10 gagaagacaa agtgtttcct caaggaaata tctatgccca atggtttatt 
gatatagaag aggttgggta tgacagagag tcaggtgttg tatggctgaa 
agcatcactc acaagttcac agagatagat aagcttgtct cctatggaac 
gccattgttg agaccgggaa aatcaagaag ctcaccggag tcaaagccaa 
atttgggtca ctatcaatga gatctatacc gaggaacccc ctaccaagat 
15 acaccgacca cactgtccag gactttcccg gtcacagctt ttatagtccc 
gccaaggagg aacctgccaa agaggaacct gccaaggaga agagcagcga 
gccaaggagg ctgttgcaat caaggaggct gtcgcagtca aagaggcggc 
cagcttcagc agataaaaga gacccgaatt tttaagtgta cttcaagaat 
catggagagt tagtgtttct cactgctcta actatcatgt attatctacc 
20 tgctgcttgt tttacctgta ataagatatc aatatcatcc tgttccact 
(2) INFORMATION FOR SEQ ID NO:229: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
30 (B) LOCATION: 1..197 

(D) OTHER INFORMATION: / Ceres Seq. ID 1441103 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:229: 



Pro 


Gin 


His 


Asn 


Phe 


Ser 


Phe 


Lys 


Thr 


Ala 


Lys 


Thr 


Lys 


T^ - 


Lys 


Glu 


1 








5 










10 










15 




Thr 


Glu 


Met 


Gly 


Leu 


Val 


Thr 


Glu 


Glu 


Val 


Arg 


Ala 


Lys 


Ala 


Glu 


Met 








20 










25 










30 






Tyr 


Thr 


Gly 


Asp 


Glu 


He 


Cys 


Arg 


Glu 


Lys 


Thr 


Lys 


Cys 


Phe 


Leu 


Lys 






35 










40 










45 








Glu 


He 


Ser 


Met 


Pro 


Asn 


Gly 


Leu 


Leu 


Pro 


Leu 


Lvs 


Asp 


He 


Glu 


Glu 




50 










55 










60 










Val 


Gly 


Tyr 


Asp 


Arg 


Glu 


Ser 


Gly Val 


Val 


Trp 


Leu 


Lys 


Gin 


Lys 


Lys 


65 










70 










75 










80 


Ser 


He 


Thr 


His 


Lys 


Phe 


Thr 


Glu 


He 


Asp 


Lys 


Leu 


Val 


Ser 


Tyr 


Gly 










85 










90 










95 




Thr 


Glu 


Val 


Thr 


Ala 


He 


Val 


Glu 


Thr 


Gly 


Lys 


He 


Lys 


Lys 


Leu 


Thr 








100 










105 










110 






Gly 


Val 


Lys 


Ala 


Lys 


Glu 


Leu 


Leu 


He 


Trp 


Val 


Thr 


He 


Asn 


Glu 


He 






115 










120 










125 








Tyr 


Thr 


Glu 


Glu 


Pro 


Pro 


Thr 


Lys 


He 


Thr 


Phe 


Lys 


Thr 


Pro 


Thr 


Thr 




130 










135 










140 










Leu 


Ser 


Arg 


Thr 


Phe 


Pro 


Val 


Thr 


Ala 


Phe 


He 


Val 


Pro 


Glu 


Glu 


Pro 


145 










150 










155 










160 


Ala 


Lys 


Glu 


Glu 


Pro 


Ala 


Lys 


Glu 


Glu 


Pro 


Ala 


Lys 


Glu 


Lys 


Ser 


Ser 










165 










170 










175 




Glu 


Ala 


Thr 


Glu 


Ala 


Lys 


Glu 


Ala 


Val 


Al a 


He 


Lys 


Glu 


Ala 


Val 


Ala 








180 










185 










190 






Val 


Lys 


Glu 


Ala 


Ala 




























195 




























(2) 


INFORMATION 


FOR 


SEQ 


ID NO:230: 

















60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 amino acids 

(B) TYPE: amino acid 



agagatgggt 60 

gatatgtaga 120 

accattgaag 180 

gcagaagaag 240 

cgaagtcaca 300 

ggaacttctt 360 

cacgttcaag 420 

agaagaacct 480 

agccaccgag 540 

ctaatcagct 600 

ttgaacaaat 660 

taccacttgt 720 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

5 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 17 9 

(D) OTHER INFORMATION : / Ceres Seq. ID 1441104 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: 



Met 


Gly 


Leu 


Val 


Thr 


Glu 


Glu 


Val 


Arg 


Ala 


Lys 


Ala 


Glu 


Met 


Tyr 


Thr 


1 








5 










10 










15 




Gly 


Asp 


Glu 


He 


Cys 


Arg 


Glu 


Lys 


Thr 


Lys 


Cys 


Phe 


Leu 


Lys 


Glu 


He 








20 










25 










30 






Ser 


Met 


Pro 


Asn 


Gly 


Leu 


Leu 


Pro 


Leu 


Lys 


Asp 


He 


Glu 


Glu 


Val 


Gly 






35 










40 










4 5 








Tyr 


Asp 


Arg 


Glu 


Ser 


Gly 


Val 


Val 


Trp 


Leu 


Lys 


Gin 


Lys 


Lys 


Ser 


He 




50 










55 










60 










Thr 


His 


Lys 


Phe 


Thr 


Glu 


He 


Asp 


Lys 


Leu 


Val 


Ser 


Tyr 


Gly 


Thr 


Glu 


65 










70 










75 










80 


Val 


Thr 


Ala 


He 


Val 


Glu 


Thr 


Gly 


Lys 


He 


Lys 


Lys 


Leu 


Thr 


Gly 


Val 










85 










90 










95 




Lys 


Ala 


Lys 


Glu 


Leu 


Leu 


He 


Trp 


Val 


Thr 


He 


Asn 


Glu 


He 


Tyr 


Thr 








100 










105 










110 






Glu 


Glu 


Pro 


Pro 


Thr 


Lys 


He 


Thr 


Phe 


Lys 


Thr 


Pro 


Thr 


Thr 


Leu 


Ser 






115 










120 










125 








Arg 


Thr 


Phe 


Pro 


Val 


Thr 


Ala 


Phe 


He 


Val 


Pro 


Glu 


Glu 


Pro 


Ala 


Lys 




130 










135 










140 










Glu 


Glu 


Pro 


Ala 


Lys 


Glu 


Glu 


Pro 


Ala 


Lys 


Glu 


Lys 


Ser 


Ser 


Glu 


Ala 


145 










150 










155 










160 


Thr 


Glu 


Ala 


Lys 


Glu 


Ala 


Val 


Ala 


He 


Lys 


Glu 


Ala 


Val 


Ala 


Val 


Lys 










165 










170 










175 





Glu Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 231: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..166 

(D) OTHER INFORMATION: / Ceres Seq. ID 1441105 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



Met 


Tyr 


Thr 


Gly 


Asp 


Glu 


He 


Cys 


Arg 


Glu 


Lys 


Thr 


Lys 


Cys 


Phe 


Leu 


1 








5 










10 










15 




Lys 


Glu 


He 


Ser 


Met 


Pro 


Asn 


Gly 


Leu 


Leu 


Pro 


Leu 


Lys 


Asp 


He 


Glu 








20 










25 










30 






Glu 


Val 


Gly 


Tyr 


Asp 


Arg 


Glu 


Ser 


Gly 


Val 


Val 


Trp 


Leu 


Lys 


Gin 


Lys 






35 










40 










45 








Lys 


Ser 


He 


Thr 


His 


Lys 


Phe 


Thr 


Glu 


He 


Asp 


Lys 


Leu 


Val 


Ser 


Tyr 




50 










55 










60 










Gly 


Thr 


Glu 


Val 


Thr 


Ala 


He 


Val 


Glu 


Thr 


Gly 


Lys 


He 


Lys 


Lys 


Leu 


65 










70 










75 










80 


Thr 


Gly 


Val 


Lys 


Ala 


Lys 


Glu 


Leu 


Leu 


He 


Trp 


Val 


Thr 


He 


Asn 


Glu 










85 










90 










95 




He 


Tyr 


Thr 


Glu 


Glu 


Pro 


Pro 


Thr 


Lys 


He 


Thr 


Phe 


Lys 


Thr 


Pro 


Thr 








100 










105 










110 






Thr 


Leu 


Ser 


Arg 


Thr 


Phe 


Pro 


Val 


Thr 


Ala 


Phe 


He 


Val 


Pro 


Glu 


Glu 






115 










120 










125 








Pro 


Ala 


Lys 


Glu 


Glu 


Pro 


Ala 


Lys 


Glu 


Glu 


Pro 


Ala 


Lys 


Glu 


Lys 


Ser 



130 135 140 
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Ser Glu Ala Thr Glu Ala Lys Glu Ala Val Ala lie Lys Glu Ala Val 
145 150 155 160 

Ala Val Lys Glu Ala Ala 
165 

(2) INFORMATION FOR SEQ ID NO: 232: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) .MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 607 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447480 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



aaaatctcta aagtttcaat ttttt kgttc tgttcttttt ttttttaaag aatggcttca 60 

atttctgcaa ctttgccttc gccattgtta ctcacacaga gaaaatccaa tctcacatcg 120 

attcaaaaac tcccattttc tctaactcga ggtacgaatg atctttctcc attatctctt 180 

actcgaaacc ctagcagcat cagtctgatg gtgaaagcta gtggagaaag ctcagattca 240 

tcgactgatc tcgacgttgt tagtacgatt cagaatgttg caattgacaa attgcccgtt 300 

atctcgagcg gattcgaact agttggtatc ttgrtctcca cgtggttcac atatcgatat 360 

ctcttgttca aaccggacag acaggagctt tcgaaaattg tcaagaaatc agtagcggat 420 

atacttggcc agtgaacctt gtgtgtgtga taatacttca tctttggaag atgatttgtt 480 

tgcaagtttg taaaattaca tgacagggtg gttgttgttt ctagtccaat aatgtcatgc 540 

atttgaaacc tgtaaatact ttattgttgg tttttggttg tgagcaaaat caatcttttc 600 



taattt c 

(2) INFORMATION FOR SEQ ID NO: 233: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 127 

(D) OTHER INFORMATION : / Ceres Seq. ID 1447481 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 



Met 


Ala 


Ser 


He 


Ser 


Ala 


Thr 


Leu 


Pro 


Ser 


Pro 


Leu 


Leu 


Leu 


Thr 


Gin 


1 








5 










10 










15 




Arg 


Lys 


Ser 


Asn 
20 


Leu 


Thr 


Ser 


lie 


Gin 
25 


Lys 


Leu 


Pro 


Phe 


Ser 
30 


Leu 


Thr 


Arg 


Gly 


Thr 

35 


Asn 


Asp 


Leu 


Ser 


Pro 
40 


Leu 


Ser 


Leu 


Thr 


Arg 
45 


Asn 


Pro 


Ser 


Ser 


He 

50 


Ser 


Leu 


Met 


Val 


Lys 
55 


Ala 


Ser 


Gly 


Glu 


Ser 
60 


Ser 


Asp 


Ser 


Ser 


Thr 


Asp 


Leu 


Asp 


Val 


Val 


Ser 


Thr 


He 


Gin 


Asn 


Val 


Ala 


He 


Asp 


Lys 


65 










70 










75 










80 


Leu 


Pro 


Val 


He 


Ser 
85 


Ser 


Gly 


Phe 


Glu 


Leu 
90 


Val 


Gly 


He 


Leu 


Phe 
95 


Ser 


Thr 


Trp 


Phe 


Thr 
100 


Tyr 


Arg 


Tyr 


Leu 


Leu 
105 


Phe 


Lys 


Pro 


Asp 


Arg 
110 


Gin 


Glu 


Leu 


Ser 


Lys 
115 


He 


Val 


Lys 


Lys 


Ser 
120 


Val 


Ala 


Asp 


He 


Leu 
125 


Gly 


Gin 





(2) INFORMATION FOR SEQ ID NO: 234: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..75 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447482 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 
Met Val Lys Ala Ser Gly Glu Ser Ser Asp Ser Ser Thr Asp Leu Asp 
1 5 10 15 

Val Val Ser Thr lie Gin Asn Val Ala lie Asp Lys Leu Pro Val lie 

20 25 30 

Ser Ser Gly Phe Glu Leu Val Gly lie Leu Phe Ser Thr Trp Phe Thr 

35^ 40 45 

Tyr Arg Tyr Leu Leu Phe Lys Pro Asp Arg Gin Glu Leu Ser Lys lie 

50 55 " 60 

Val Lys Lys Ser Val Ala Asp lie Leu Gly Gin 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 235: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 668 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447577 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:235: 
aaaaaaaaac aaaacaaaaa attatattca agagaaaaag gaaaaaatga atttcatctc 60 
cgatcaggta aagaaactct caagctcaac accagaggcc agaccacaac aagccagtcg 120 
aaggaaccga aacagctaca agaccagcta ccaacgccga gctcatggca agtgccaagg 180 
ttgtagctga agctgctcaa gccgcagctc gtaacgaatc agacaaactc gacaagggta 240 
aagtcgccgg agcctctgct gatatct^ag acgctgccga gaaatacggt aagttcgatg 300 
aaaagagtag cactggtcag tacctcga^a aggr.tgagaa gtatctcaac gactacgagt 360 
cgtcacactc caccggtgct ggtggtcctc ctcctccgac gagtcaggct gagccagcaa 420 
gtcagcctga gccggcggct aagaaagacg atgaagagtc tggtggtggg cttggaggtt 480 
atgccaagat ggctcaaggt ttcttgaagt gatttgatct ttaattgttg ttcatcattt 540 
tcgtaataat aaattaaata actagtatcg tttgtgacta gtttatgttg cttcgtttat 600 
gtttatgggg agtgacgagt gagtgtaata acttctggtg atcatgaatc taatccatct 660 
ttgttgtc 

(2) INFORMATION FOR SEQ ID NO: 23 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447578 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:236: 
Lys Lys Asn Lys Thr Lys Asn Tyr lie Gin Glu Lys Lys Glu Lys Met 
15 10 15 

Asn Phe lie Ser Asp Gin Val Lys Lys Leu Ser Ser Ser Thr Pro Glu 

20 ~ 25 30 

Ala Arg Pro Gin Gin Ala Ser Arg Arg Asn Arg Asn Ser Tyr Lys Thr 

35 40 45 

Ser Tyr Gin Arg Arg Ala His Gly Lys Cys Gin Gly Cys Ser 

50 55 60 

(2) INFORMATION FOR SEQ ID NO: 237: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .115 

(D) OTHER INFORMATION : / Ceres Seq. ID 1447579 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 
10 Met Ala Ser Ala Lys Val Val Ala Glu Ala Ala Gin Ala Ala Ala Arg 
15 10 15 

Asn Glu Ser Asp Lys Leu Asp Lys Gly Lys Val Ala Gly Ala Ser Ala 

20 * 25 30 

Asp lie Leu Asp Ala Ala Glu Lys Tyr Gly Lys Phe Asp Glu Lys Ser 
15 35 40 45 

Ser Thr Gly Gin Tyr Leu Asp Lys Ala Glu Lys Tyr Leu Asn Asp Tyr 

50 " 55 60 

Glu Ser Ser His Ser Thr Gly Ala Gly Gly Pro Pro Pro Pro Thr Ser 
65 70 75 80 

20 Gin Ala Glu Pro Ala Ser Gin Pro Glu Pro Ala Ala Lys Lys Asp Asp 

8 5 90 95 

Glu Glu Ser Gly Gly Gly Leu Gly Gly Tyr Ala Lys Met Ala Gin Glv 
100 105 110 

Phe Leu Lys 
25 115 

{2} INFORMATION FOR SEQ ID NO: 238: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
35 (B) LOCATION: 1 . . 69 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447580 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238: 
Met Lys Arg Val Ala Leu Val Ser Thr Ser Thr Arg Leu Arg Ser lie 
15 10 15 

4 0 Ser Thr Thr Thr Ser Arg His Thr Pro Pro Val Leu Val Val Leu Leu 

20 25 30 

Leu Arg Arg Val Arg Leu Ser Gin Gin Val Ser Leu Ser Arg Arg Leu 

35 " 4 0 4 5 

Arg Lys Thr Met Lys Ser Leu Val Val Gly Leu Glu Val Met Pro Arg 

4 5 5 0 ' 55 60 

Trp Leu Lys Val Ser 
65 

(2) INFORMATION FOR SEQ ID NO: 239: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 5 (ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..717 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447922 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 
60 cttcgtctct gtctccaaaa tcgaatcaaa atctctaaag tttcaatttt tttgttctgt 60 
tctttttttt tttttaaaga • atggcttcaa tttctgcaac tttgccttcg ccattgttac 120 
tcacacagag aaaatccaat ctcacatcga ttcaaaaact cccattttct ctaactcgag 180 
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gtacgaatga tctttctcca ttatctctta ctcgaaaccc tagcagcatc agtctgatgg 240 

tgaaagctag tggagaaagc tcagattcat cgactgatct cgacgttgtt agtacgattc 300 

agaatgtttg ggataagtct gaagataggt taggtcttat tggtttgggt tttgctggta 360 

ttgtagctct ttgggcatca ttgaatctca tcacggcaat tgacaaattg cccgttatct 420 

cgagcggatt cgaactagtt ggtatcttgt tctccacgtg gttcacatat cgatatctct 480 

tgttcaaacc ggacagacag gagctttcga aaattgtcaa gaaatcagta gcggatatac 540 

ttggccagtg aaccttgtgt gtgtgataat acttcatctt tggaagatga tttgtttgca 600 

agtttgtaaa attacatgac agggtggttg ttgtttctag tccaataatg tcatgcattt 660 



gaaacctgta aatactttat tgttggtttt tggttgtgag caaaatcaat cttttct 
(2) INFORMATION FOR SEQ ID NO: 240: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..156 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447923 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 



Met 


Ala 


Ser 


He 


Ser 


Ala 


Thr 


Leu 


Pro 


Ser 


Pro 


Leu 


Leu 


Leu 


Thr 


Gin 


1 








5 










10 










15 




Arg 


Lys 


Ser 


Asn 


Leu 


Thr 


Ser 


He 


Gin 


Lys 


Leu 


Pro 


Phe 


Ser 


Leu 


Thr 








20 










25 










30 






Arg 


Gly 


Thr 


Asn 


Asp 


Leu 


Ser 


Pro 


Leu 


Ser 


Leu 


Thr 


Arg 


Asn 


Pro 


Ser 






35 










40 










45 








Ser 


He 


Ser 


Leu 


Met 


Val 


Lys 


Ala 


Ser 


Gly 


Glu 


Ser 


Ser 


Asp 


Ser 


Ser 




50 










55 










60 










Thr 


Asp 


Leu 


Asp 


Val 


Val 


Ser 


Thr 


He 


Gin 


Asn 


Val 


Trp 


Asp 


Lys 


Ser 


65 










70 










75 










80 


Glu 


Asp 


Arg 


Leu 


Gly 


Leu 


He 


Gly 


Leu 


Gly 


Phe 


Ala 


Gly 


He 


Val 


Ala 










85 










90 










95 




Leu 


Trp 


Ala 


Ser 


Leu 


Asn 


Leu 


He 


Thr 


Ala 


He 


Asp 


Lys 


Leu 


Pro 


Val 








100 










105 










110 






He 


Ser 


Ser 


Gly 


Phe 


Glu 


Leu 


Val 


Gly 


He 


Leu 


Phe 


Ser 


Thr 


Trp 


Phe 






115 










120 










125 








Thr 


Tyr 


Arg 


Tyr 


Leu 


Leu 


Phe 


Lys 


Pro 


Asp 


Arg 


Gin 


Glu 


Leu 


Ser 


Lys 




130 










135 










140 










He 


Val 


Lys 


Lys 


Ser 


Val 


Ala 


Asp 


He 


Leu 


Gly 


Gin 











145 150 155 

(2) INFORMATION FOR SEQ ID NO: 241: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 104 

(D) OTHER INFORMATION: / Ceres Seq. ID 1447924 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:241: 
Met Val Lys Ala Ser Gly Glu Ser Ser Asp Ser Ser Thr Asp Leu Asp 
15 10 15 

Val Val Ser Thr He Gin Asn Val Trp Asp Lys Ser Glu Asp Arg Leu 

20 25 30 

Gly Leu He Gly Leu Gly Phe Ala Gly He Val Ala Leu Trp Ala Ser 

35 4 0 4 5 

Leu Asn Leu He Thr Ala He Asp Lys Leu Pro Val He Ser Ser Gly 

5 0 5 5 60 

Phe Glu Leu Val Gly He Leu Phe Ser Thr Trp Phe Thr Tyr Arg Tyr 
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65 70 75 80 

Leu Leu Phe Lys Pro Asp Arg Gin Glu Leu Ser Lys lie Val Lys Lys 

8 5 90 95 

Ser Val Ala Asp lie Leu Gly Gin 
5 100 

(2) INFORMATION FOR SEQ ID NO: 242: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

[A) NAME/KEY : - 
15 (B) LOCATION: 1..65 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448012 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
ggaatttctc actctctcta tctctcttag ccagctctga ccatttcgat ttttttccgg 60 
tgaaaaggga gcagaaacat ggttgtacgt atcagattat cgagatttgg atgcaaaaat 120 
20 cggccatttt ttagggttat ggctgctgat agcagatctc caagagacgg gaagcatctt 180 
gaggtcttag gttacttcaa tcctttgcca ggccaggacg gtggtaagag gatgggtctc 240 
aagttcgatc gaattaagta ctggttatct gttggtgctc agccatcaga cccggttcaa 300 
cgccttcttt tcagatccgg tttacttcct cctcctccaa tggtggctat gggacgtaaa 360 
ggtggagcac gagacacacg cccagttgat ccaatgactg gtcgctatgt ggatgcagag 420 
25 aataaaacag ttaatgccaa tgataaccag cctaaggaag aggattcaga agacaagatt 480 
gcatgattca ttagccttct gtcgtcgtag cttttcaagt tcactttgtt gtcgattata 540 
ttgtgtaatg cagcattaga caactgactt gtttcctttg tttggcgata aacggcaagg 600 
tgtttggcac tttttgcaga aacggcacat attttgcatt gggatatttt aatttt 
(2) INFORMATION FOR SEQ ID NO : 2 4 3 : 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
35 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 135 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448013 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

Met Val Val Arg lie Arg Leu Ser Arg Phe Gly Cys Lys Asn Arg Pro 
15 10 15 

Phe Phe Arg Val Met Ala Ala Asp Ser Arg Ser Pro Arg Asp Gly Lys 
20 25 30 

45 His Leu Glu Val Leu Gly Tyr Phe Asn Pro Leu Pro Gly Gin Asp Gly 
35 40 45 

Gly Lys Arg Met Gly Leu Lys Phe Asp Arg lie Lys Tyr Trp Leu Ser 

50 55 60 

Val Gly Ala Gin Pro Ser Asp Pro Val Gin Arg Leu Leu Phe Arg Ser 
50 65 70 75 80 

Gly Leu Leu Pro Pro Pro Pro Met Val Ala Met Gly Arg Lys Gly Gly 

85 90 95 

Ala Arg Asp Thr Arg Pro Val Asp Pro Met Thr Gly Arg Tyr Val Asp 
100 105 " 110 

55 Ala Glu Asn Lys Thr Val Asn Ala Asn Asp Asn Gin Pro Lys Glu Glu 
115 120 125 

Asp Ser Glu Asp Lys lie Ala 
130 ^ 135 

(2) INFORMATION FOR SEQ ID NO : 2 4 4 : 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 115 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448014 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 





Met 


Ala 


Ala 


Asp 


Ser 


Arg 


Ser 


Pro 


Arg 


Asp 


Gly 


Lys 


His 


Leu 


Glu 


Val 


10 


1 








5 










10 










15 






Leu 


Gly 


Tyr 


Phe 
20 


Asn 


Pro 


Leu 


Pro 


Gly 
25 


Gin 


Asp 


Gly 


Gly 


Lys 
30 


Arg 


Met 




Gly 


Leu 


Lys 
35 


Phe 


Asp 


Arg 


He 


Lys 
40 


Tyr 


Trp 


Leu 


Ser 


Val 
45 


Gly 


Ala 


Gin 


15 


Pro 


Ser 
50 


Asp 


Pro 


Val 


Gin 


Arg 
55 


Leu 


Leu 


Phe 


Arg 


Ser 
60 


Gly 


Leu 


Leu 


Pro 




Pro 


Pro 


Pro 


Met 


Val 


Ala 


Met 


Gly 


Arg 


Lys 


Gly 


Gly 


Ala 


Arg 


Asp 


Thr 




65 










70 










75 










80 




Arg 


Pro 


Val 


Asp 


Pro 


Met 


Thr 


Gly 


Arg 


Tyr 


Val 


Asp 


Ala 


Glu 


Asn 


Lys 


20 










85 










SO 










95 






Thr 


Val 


Asn 


Ala 


Asn 


Asp 


Asn 


Gin 


Pro 


Lys 


Glu 


Glu 


Asp 


Ser 


Glu 


Asp 



100 105 110 



Lys He Ala 
115 

2 5 (2) INFORMATION FOR SEQ ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 84 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 1448015 





(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO:245 












Met 


Gly 


Leu 


Lys 


Phe 


Asp 


Arg He Lys 


Tyr 


Trp 


Leu 


Ser 


Val 


Gly 


Ala 


1 








5 






10 










15 




Gin 


Pro 


Ser 


Asp 


Pro 


val 


Gin Ara Leu 


Leu 


Phe 


Arg 


Ser 


Gly 


Leu 


Leu 








20 






25 










30 






Pro 


Pro 


Pro 


Pro 


Met 


Val 


Ala Met Gly 


Arg 


Lys 


Gly 


Gly 


Ala 


Arg 


Asp 






35 








40 








45 








Thr 


Arg 


Pro 


Val 


Asp 


Pro 


Met Thr Gly 


Arg 


Tyr 


Val 


Asp 


Ala 


Glu 


Asn 




50 










55 






60 










Lys 


Thr 


Val 


Asn 


Ala 


Asn 


Asp Asn Gin 


Pro 


Lys 


Glu 


Glu 


Asp 


Ser 


Glu 


65 










70 






75 










80 


Asp 


Lys 


He 


Ala 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:246: 

















50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 5 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: ~ 

(B) LOCATION: 1 . . 458 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448135 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 

aattttctag ggttttgaag agtgtctctc gtcgccgttg taattcctct gttagcaaat 60 
cgacaaaatg ggtcactcta atgtatggaa ctctcatccg aagaagtacg gtcctggatc 120 
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tcgtttatgc cgtgtgtgcg ggaactcgca cgggctgatc cggaagtatg gtttgaactg 180 
ctgcagacag tgtttccgta gcaacgctaa ggagattgga ttcattaagt accgttaatc 240 
aagcaccaac ttcatgattg atgcttaatg atataaacat gaaggcgtcg atgggtttgg 300 
cttttaagct tttgtagttt ttgaaatttt tacttttgag aaccattgtt attttgggag 360 
5 ttaattaagt tgttgaacct ctcattaagc atgtcttatt ttggattaat gatgttttgg 420 
ctattctcgy atttttgttt tatcagtcaa atttgact 
(2) INFORMATION FOR SEQ ID NO: 24 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

15 (A) NAME / KEY : peptide 

(B) . LOCATION: 1 . . 51 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448136 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
Phe Ser Arg Val Leu Lys Ser Val Ser Arg Arg Arg Cys Asn Ser Ser 
20 1 5 10 15 

Val Ser Lys Ser Thr Lys Trp Val Thr Leu Met Tyr Gly Thr Leu lie 

20 " 25 30 

Arg Arg Ser Thr Val Leu Asp Leu Val Tyr Ala Val Cys Ala Gly Thr 
35 40 45 

25 Arg Thr Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 24 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

35 (A) NAME /KEY : peptide 

(B) LOCATION: 1..56 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448137 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: 
Met Gly His Ser Asn Val Trp Asn Ser His Pro Lys Lys Tyr Gly Pro 
40 1 5 10 15 

Gly Ser Arg Leu Cys Arg Val Cys Gly Asn Ser His Gly Leu lie Arg 

20 25 30 

Lys Tyr Gly Leu Asn Cys Cys Arg Gin Cys Phe Arg Ser Asn Ala Lys 
35 "40 4 5 

4 5 Glu lie Gly Phe lie Lys Tyr Arg 
50 55 
(2) INFORMATION FOR SEQ ID NO: 24 9: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

55 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 38 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448138 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 
Met Glu Leu Ser Ser Glu Glu Val Arg Ser Trp lie Ser Phe Met Pro 
60 1 5 10 15 

Cys Val Arg Glu Leu Ala Arg Ala Asp Pro Glu Val Trp Phe Glu Leu 
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Leu Gin Thr Val Phe Pro 
35 

(2) INFORMATION FOR SEQ ID NO: 250: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 603 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448185 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:250: 



gaagaagaga gtatcgttgt gaggtttggg gatcgcgaaa atggaggttc caggttcatc 60 

gaagaagatg atcgcaacgc aagaagagat gtctgcagct aaaatagcac ttggatctag 120 

agatatgtgc gctcatctct tgattccgct caacaaatgt cgtcaggctg agttttacct 180 

tccatggaaa tgtgaagacg agcgtcacgt ttatgagaag tgtgaatacg agcttgttat 240 

ggagagaatg cttgcgatga agaagatccg tgaagaagaa gctttggcta aacagaataa 300 

actacaagga aacgctgctg ttcctcttat ccctaaaact gctaatgctt aggattcgat 360 

tccttctcca atttgccgat tccagattcc gggattctct ggaacrgtga agatggtggg 420 

ggctctgctt ttcaatcttt ttcttcttgc tagtgatgaa aaattgttgc tacattttca 480 

gatgtgaatg ttcaagattc tctatctttt tttttttttt gaactctttt taacttggag 540 

tggtttccca aaaaataaga tgcaaaactc atctttttgt tggttttcta tctttaatct 600 



gtg 

(2) INFORMATION FOR SEQ ID NO: 251: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..103 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448186 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 



Met 


Glu 


Val 


Pro 


Gly 


Ser 


Ser 


Lys 


Lys 


Met 


He 


Ala 


Thr 


Gin 


Glu 


Glu 


1 








5 










10 










15 




Met 


Ser 


Ala 


Ala 


Lys 


He 


Ala 


Leu 


Gly 


Ser 


Arg 


Asp 


Met 


Cys 


Ala 


His 








20 










25 










30 






Leu 


Leu 


He 


Pro 


Leu 


Asn 


Lys 


Cys 


Arg 


Gin 


Ala 


Glu 


Phe 


Tyr 


Leu 


Pro 






35 










40 










45 








Trp 


Lys 


Cys 


Glu 


Asp 


Glu 


Arg 


His 


Val 


Tyr 


Glu 


Lys 


Cys 


Glu 


Tyr 


Glu 




50 










55 










60 










Leu 


Val 


Met 


Glu 


Arg 


Met 


Leu 


Ala 


Met 


Lys 


Lys 


He 


Arg 


Glu 


Glu 


Glu 


65 










70 










75 










80 


Ala 


Leu 


Ala 


Lys 


Gin 


Asn 


Lys 


Leu 


Gin 


Gly 


Asn 


Ala 


Ala 


Val 


Pro 


Leu 










85 










90 










95 




He 


Pro 


Lys 


Thr 


Ala 


Asn 


Ala 





















100 

(2) INFORMATION FOR SEQ ID NO: 252: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..94 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448187 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
Met lie Ala Thr Gin Glu Glu Met Ser Ala Ala Lys lie Ala Leu Gly 
15 10 15 

Ser Arg Asp Met Cys Ala His Leu Leu lie Pro Leu Asn Lys Cys Arg 

20 2 5 30 

Gin Ala Glu Phe Tyr Leu Pro Trp Lys Cys Glu Asp Glu Arg His Val 

35 * 4 0 4 5 

Tyr Glu Lys Cys Glu Tyr Glu Leu Val Met Glu Arg Met Leu Ala Met 

50 55 60 

Lys Lys lie Arg Glu Glu Glu Ala Leu Ala Lys Gin Asn Lys Leu Gin 
65 70 75 80 

Gly Asn Ala Ala Val Pro Leu lie Pro Lys Thr Ala Asn Ala 

85 90 
(2) INFORMATION FOR SEQ ID NO: 253: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 87 

(D) OTHER INFORMATION: / Ceres Seq. ID 1448188 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 



Met 


Ser 


Ala 


Ala 


Lys 


He 


Ala Leu 


Gly 


Ser 


Arg 


Asp 


Met 


Cys 


Ala 


His 


1 








5 








10 










15 




Leu 


Leu 


He 


Pro 


Leu 


Asn 


Lys Cys 


Arg 


Gin 


Ala 


Glu 


Phe 


Tyr 


Leu 


Pro 








20 








25 










30 






Trp 


Lys 


Cys 


Glu 


Asp 


Glu 


Arg His 


Val 


Tyr 


Glu 


Lys 


Cys 


Glu 


Tyr 


Glu 






35 








4 0 










45 








Leu 


Val 


Met 


Glu 


Arg 


Met 


Leu Ala 


Met 


lys 


Lys 


He 


Arg 


Glu 


Glu 


Glu 




50 










55 








60 










Ala 


Leu 


Ala 


Lys 


Gin 


Asn 


Lys Leu 


Gin 


Gly 


Asn 


Ala 


Ala 


Val 


Pro 


Leu 


65 










70 








75 










80 


He 


Pro 


Lys 


Thr 


Ala 
85 


Asn 


Ala 


















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 25 4: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2034 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 2034 

(D) OTHER INFORMATION: / Ceres Seq. ID 1450875 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
tttatcaaaa tcgatggctg cttctagatt gtgttccgcg gcggcgatag ctgctgcctt 60 
cacttcaatg tcaatgtctc agaaccgtgc ttacgccgat tcttctcgat tccggttrcc 120 
tttcttttct tcttctcctt ctcctcctcc gtcagattct ccggcgaatc aatcttctac 180 
aaactctagt aagtctaaag cagagcctga tgagcctaaa ggarcgggrr ttgatcctga 240 
ggctcttgag agagctgcta aagctcttag agatatcaat agctctcccc attccaaaca 300 
ggtgtttgat ctcatgagga agcaggagaa aactcggtta gctgaattag cggcagagac 360 
ttctcattac gaagctattc aagcacacaa tgatattggc agacagcaga aattggctga 420 
ggaccagaga aatcttttgc agacacaggc gcaaaccaaa gcgcaaaatc tgcgatatga 480 
ggatgaattg gccagaaaga gacagcagac agatcatgaa gctcagaggc atcataatgt 54 0 

ggaattggtt aagatgcaag aggcgtcttc tatcaggaaa gagaaggcaa aaatcgccac 600 
agaagaacag atccaagctc agcatcgcca aactgagaaa gagagagctg aacttgagcg 660 
agagacgatt cgtgtcaagg ccatggctga agctgaaggt cgggctcatg aagccaaact 720 
tactgaagag caaaacagaa gattgcttat ggaaaggatt aatggtgaaa gagagaagtg 78 0 
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gcttgctgca atcaacacaa tgttcagtca catcgaaggg ggattcagga ccttattaac 840 
tgatcgaaat aagctgatta tgactgttgg aggagctact gcattagctg caggggttta 900 
tacaactcgt gaaggagcta gagttacatg gggttatatt aataggatgc ttggacaacc 960 
gtcacttatt cgagaatctt ccatgcgtag atttccatgg acaggctcag tgtctcaatt 1020 
5 taagaacagg atctcagggg ctgcagcagc ttctgcagca gaaggcaaaa agccgcttga 1080 
taatgtaatt ctccatactt ctttgaagaa acgaatcgag cgtctcgcta gagctacagc 1140 
aaacaccaaa tcccatcaag caccattccg caacatgatg ttttatggac ctcctggtac 1200 
cggtaaaact atggtggcaa gggaaatagc tcggaaatcg ggtctggatt atgctatgat 1260 
gacaggaggt gatgttgctc ccttgggatc acaagctgtt actaaaatcc atcagatatt 1320 

10 tgattgggct aagaaatcga acagaggctt actccttttc attgatgaag ccgatgcttt 1380 
tctatgcgag cgtaacagca cttacatgag tgaggctcaa cgtagtgctc tgaacgcttt 1440 
gctcttccga actggtgatc aatctcggga cattgttctt gtcttggcta caaacagacc 1500 
tggagatctc gatagtgcgg ttacagacag gattgatgaa gtcattgagt tcccacttcc 1560 
aggggaagaa gaacgtttca agcttctcaa tctctatctc aacaaatatc taaagatggg 1620 

15 tgataacaac gaagacacaa aaccgaaatg gagtcatttg tttaagaagc tgtcacagaa 1680 
gattaccgtt gaagaagacr taactgataa agtgatttct gaggctgcaa agaagacaga 1740 
aggattctct ggccgtgaga ttgcaaagct tgtggctgga gtacaagctg gagtgtacgg 1800 
acgagcggat tgtgttttgg attcacagct ttttaaagag attgttgaat ataaggttga 1860 
agaacatcac cgaaggcata tgcttgcttc tgaaggtttt cagccattac tcttctctta 1920 

20 gtcatttgga ttttgattat acatgtcatt tacttgatca gaaagaagaa tctgattgtt 1980 
ttaaaatgag tcctaaaatt gaatttttag attaaacatg ttaaagagtt ttac 
(2) INFORMATION FOR SEQ ID NO: 255: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 amino acids 
25 ( B ) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

30 (A) NAME /KEY : peptide 

(B) LOCATION: 1..639 

(D) OTHER INFORMATION: / Ceres Seq. ID 1450876 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
Leu Ser Lys Ser Met Ala Ala Ser Arg Leu Cys Ser Ala Ala Ala lie 
35 1 5 10 15 

Ala Ala Ala Phe Thr Ser Met Ser Met Ser Gin Asn Arg Ala Tyr Ala 

20 25 30 

Asp Ser Ser Arg Phe Arg Phe Pro Phe Phe Ser Ser Ser Pro Ser Pro 
35 4 0 4 5 

4 0 Pro Pro Ser Asp Ser Pro Ala Asn Gin Ser Ser Thr Asn Ser Ser Lys 
50 55 60 

Ser Lys Ala Glu Pro Asp Glu Pro Lys Gly Ser Gly Phe Asp Pro Glu 
65 70 75 80 

Ala Leu Glu Arg Ala Ala Lys Ala Leu Arg Asp lie Asn Ser Ser Pro 
4 5 85 90 95 

His Ser Lys Gin Val Phe Asp Leu Met Arg Lys Gin Glu Lys Thr Arg 

100 105 110 

Leu Ala Glu Leu Ala Ala Glu Thr Ser His Tyr Glu Ala lie Gin Ala 
115 120 125 

50 His Asn Asp lie Gly Arg Gin Gin Lys Leu Ala Glu Asp Gin Arg Asn 
130 135 140 

Leu Leu Gin Thr Gin Ala Gin Thr Lys Ala Gin Asn Leu Arg Tyr Glu 
145 150 155 160 

Asp Glu Leu Ala Arg Lys Arg Gin Gin Thr Asp His Glu Ala Gin Arg 
55 * 165 170 175 

His His Asn Val Glu Leu Val Lys Met Gin Glu Ala Ser Ser lie Arg 

180 185 190 

Lys Glu Lys Ala Lys lie Ala Thr Glu Glu Gin lie Gin Ala Gin His 
195 200 205 

60 Arg Gin Thr Glu Lys Glu Arg Ala Glu Leu Glu Arg Glu Thr lie Arg 
210 215 220 

Val Lys Ala Met Ala Glu Ala Glu Gly Arg Ala His Glu Ala Lys Leu 
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225 










230 










235 










240 


Thr 


Glu 


Glu 


Gin 


Asn 


Arg 


Arg 


Leu 


Leu 


Met 


Glu 


Arg 


He 


Asn 


Gly 


Glu 










245 










250 










255 




Arg 


Glu 


Lys 


Trp 


Leu 


Ala 


Ala 


He 


Asn 


Thr 


Met 


Phe 


Ser 


His 


He 


Glu 








260 










265 










270 






Gly 


Gly 


Phe 


Arg 


Thr 


Leu 


Leu 


Thr 


Asp 


Arg 


Asn 


Lys 


Leu 


He 


Met 


Thr 






275 










280 










285 








Val 


Gly 


Gly 


Ala 


Thr 


Ala 


Leu 


Ala 


Ala 


Gly 


Val 


Tyr 


Thr 


Thr 


Arg 


Glu 




290 










295 










300 










Gly 


Ala 


Arg 


Val 


Thr 


Trp 


Gly 


Tyr 


He 


Asn 


Arg 


Met 


Leu 


Gly 


Gin 


Pro 


305 










310 










315 










320 


Ser 


Leu 


He 


Arg 


Glu 


Ser 


Ser 


Met 


Arg 


Arg 


Phe 


Pro 


Trp 


Thr 


Gly 


Ser 










325 










330 










335 




Val 


Ser 


Gin 


Phe 


Lys 


Asn 


Arg 


He 


Ser 


Gly 


Ala 


Ala 


Ala 


Ala 


Ser 


Ala 








340 










345 










350 






Ala 


Glu 


Gly 


Lys 


Lys 


Pro 


Leu 


Asp 


Asn 


Val 


He 


Leu 


His 


Thr 


Ser 


Leu 






355 










360 










365 








Lys 


Lys 


Arg 


He 


Glu 


Arg 


Leu 


Ala 


Arg 


Ala 


Thr 


Ala 


Asn 


Thr 


Lys 


Ser 




370 










375 










380 










His 


Gin 


Ala 


Pro 


Phe 


Arg 


Asn 


Met 


Met 


Phe 


Tyr 


Gly 


Pro 


Pro 


Gly 


Thr 


385 










390 










395 










400 


Gly 


Lys 


Thr 


Met 


Val 


Ala 


Arg 


Glu 


He 


Ala 


Arg 


Lys 


Ser 


Gi y 


Leu 


Asp 










405 










410 










415 




Tyr 


Ala 


Met 


Met 


Thr 


Gly 


Gly 


Asp 


Val 


Ala 


Pro 


Leu 


Gly 


Ser 


Gin 


Ala 








420 










425 










430 






Val 


Thr 


Lys 


He 


His 


Gin 


He 


Phe 


Asp 


Trp 


Ala 


Lys 


Lys 


Ser 


Asn 


Arg 






435 










440 










445 








Gly 


Leu 


Leu 


Leu 


Phe 


He 


Asp 


Glu 


Ala 


Asp 


Ala 


Phe 


Leu 


Cys 


Glu 


Arg 




450 










455 










460 










Asn 


Ser 


Thr 


Tyr 


Met 


Ser 


Glu 


Ala 


Gin 


Arg 


Ser 


Ala 


Leu 


Asn 


Ala 


Leu 


465 










470 










475 










480 


Leu 


Phe 


Arg 


Thr 


Gly 


Asp 


Gin 


Ser 


Arg 


Asp 


He 


Val 


Leu 


Val 


Leu 


Ala 










4 85 










490 










495 




Thr 


Asn 


Arg 


Pro 


Gly 


Asp 


Leu 


Asp 


Ser 


Ala 


Val 


Thr 


Asp 


Arg 


He 


Asp 








500 










505 










510 






Glu 


Val 


He 


Glu 


Phe 


Pro 


Leu 


Pro 


Gly 


Glu 


Glu 


Glu 


Arg 


Phe 


Lys 


Leu 






515 










520 










525 








Leu 


As n 


Leu 


Tyr 


Leu 


Asn 


Lys 


Tyr 


Leu 


Lys 


Met 


Gly 


Asp 


Asn 


Asn 


Glu 




530 










535 










540 










Asp 


Thr 


Lys 


Pro 


Lys 


Trp 


Ser 


His 


Leu 


Phe 


Lys 


Lys 


Leu 


Ser 


Gin 


Lys 


545 










550 










555 










560 


He 


Thr 


Val 


Glu 


Glu 


Asp 


Leu 


Thr 


Asp 


Lys 


Val 


He 


Ser 


Glu 


Ala 


Ala 










565 










570 










575 




Lys 


Lys 


Thr 


Glu 


Gly 


Phe 


Ser 


Gly 


Arg 


Glu 


He 


Ala 


Lys 


Leu 


Val 


Ala 








580 










585 










590 






Gly 


Val 


Gin 


Ala 


Gly 


Val 


Tyr 


Gly 


Arg 


Ala 


Asp 


Cys 


Val 


Leu 


Asp 


Ser 






595 










600 










605 








Gin 


Leu 


Phe 


Lys 


Glu 


He 


Val 


Glu 


Tyr 


Lys 


Val 


Glu 


Glu 


His 


His 


Arg 




610 










615 










620 










Arg 


His 


Met 


Leu 


Ala 


Ser 


Glu 


Gly 


Phe 


Gin 


Pro 


Leu 


Leu 


Phe 


Ser 





625 630 635 

(2) INFORMATION FOR SEQ ID NO: 256: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 amino acids 

(B) TYPE: amino acici 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..635 

(D) OTHER INFORMATION: / Ceres Seq. ID 1450877 



WO 00/40695 



PCT/US00/00466 



444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 
Met Ala Ala Ser Arg Leu Cys Ser Ala Ala Ala lie Ala Ala Ala Phe 
15 10 15 

Thr Ser Met Ser Met Ser Gin Asn Arg Ala Tyr Ala Asp Ser Ser Arg 
5 20 25 30 

Phe Arg Phe Pro Phe Phe Ser Ser Ser Pro Ser Pro Pro Pro Ser Asp 

35 4 0 4 5 

Ser Pro Ala Asn Gin Ser Ser Thr Asn Ser Ser Lys Ser Lys Ala Glu 
50 55 60 

10 Pro Asp Glu Pro Lys Gly Ser Gly Phe Asp Pro Glu Ala Leu Glu Arg 
65 70 75 80 

Ala Ala Lys Ala Leu Arg Asp lie Asn Ser Ser Pro His Ser Lys Gin 

8 5 ^ 90 95 

Val Phe Asp Leu Met Arg Lys Gin Glu Lys Thr Arg Leu Ala Glu Leu 
15 ~ 100 105 110 

Ala Ala Glu Thr Ser His Tyr Glu Ala He Gin Ala His Asn Asp He 

115 120 125 

Gly Arg Gin Gin Lys Leu Ala Glu Asp Gin Arg Asn Leu Leu Gin Thr 
130 ~ 135 140 

20 Gin Ala Gin Thr Lys Ala Gin Asn Leu Arg Tyr Glu Asp Glu Leu Ala 
145 150 155 160 

Arg Lys Arg Gin Gin Thr Asp His Glu Ala Gin Arg His His Asn Val 

165 17 0 17 5 

Glu Leu Val Lys Met Gin Glu Ala Ser Ser He Arg Lys Glu Lys Ala 
25 180 185 190 

Lys He Ala Thr Glu Glu Gin He Gin Ala Gin His Arg Gin Thr Glu 

195 200 205 

Lys Glu Arg Ala Glu Leu Glu Arg Glu Thr He Arg Val Lys Ala Met 
210 215 220 

30 Ala Glu Ala Glu Gly Arg Ala His Glu Ala Lys Leu Thr Glu Glu Gin 
225 230 235 240 

Asn Arg Arg Leu Leu Met Glu Arg He Asn Gly Glu Arg Glu Lys Trp 

245 250 255 

Leu Ala Ala He Asn Thr Met Phe Ser His He Glu Gly Gly Phe Arg 
35 260 265 270 

Thr Leu Leu Thr Asp Arg Asn Lys Leu He Met Thr Val Gly Gly Ala 

275 280 285 

Thr Ala Leu Ala Ala Gly Val Tyr Thr Thr Arg Glu Gly Ala Arg Val 
290 295 300 

4 0 Thr Trp Gly Tyr He Asn Arg Met Leu Gly Gin Pro Ser Leu He Arg 
305 " " 310 315 320 

Glu Ser Ser Met Arg Arg Phe Pro Trp Thr Gly Ser Val Ser Gin Phe 

325 330 335 

Lys Asn Arg He Ser Gly Ala Ala Ala Ala Ser Ala Ala Glu Gly Lys 
45 340 ' 345 350 

Lys Pro Leu Asp Asn Val He Leu His Thr Ser Leu Lys Lys Arg He 

355 360 365 

Glu Arg Leu Ala Arg Ala Thr Ala Asn Thr Lys Ser His Gin Ala Pro 
370 375 380 

50 Phe Arg Asn Met Met Phe Tyr Gly Pro Pro Gly Thr Gly Lys Thr Met 
385 390 395 400 

Val Ala Arg Glu He Ala Arg Lys Ser Gly Leu Asp Tyr Ala Met Met 

405 410 415 

Thr Gly Gly Asp Val Ala Pro Leu Gly Ser Gin Ala Val Thr Lys He 
55 ~ 420 425 430 

His Gin He Phe Asp Trp Ala Lys Lys Ser Asn Arg Gly Leu Leu Leu 

4 35 4 40 4 45 

Phe He Asp Glu Ala Asp Ala Phe Leu Cys Glu Arg Asn Ser Thr Tyr 
450 455 460 

60 Met Ser Glu Ala Gin Arg Ser Ala Leu Asn Ala Leu Leu Phe Arg Thr 
465 470 475 480 

Gly Asp Gin Ser Arg Asp He Val Leu Val Leu Ala Thr Asn Arg Pro 
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485 










490 










495 




Gly Asp 


Leu 


Asp 


Ser 


Ala 


Val 


Thr 


Asp 


Arg 


He 


Asp 


Glu 


Val 


He 


Glu 








500 










505 










510 






Phe 


Pro 


Leu 


Pro 


Gly 


Glu 


Glu 


Glu 


Arg 


Phe 


Lys 


Leu 


Leu 


Asn 


Leu 


Tyr 






515 










520 










525 








Leu 


Asn 


Lys 


Tyr 


Leu 


Lys 


Met 


Gly 


Asp 


Asn 


Asn 


Glu 


Asp 


Thr 


Lys 


Pro 




530 










535 










540 










Lys 


Trp 


Ser 


His 


Leu 


Phe 


Lys 


Lys 


Leu 


Ser 


Gin 


Lys 


He 


Thr 


Val 


Glu 


545 










550 










555 










560 


Glu 


Asp 


Leu 


Thr 


Asp 


Lys 


Val 


He 


Ser 


Glu 


Ala 


Ala 


Lys 


Lys 


Thr 


Glu 










565 










570 










575 




Gly 


Phe 


Ser 


Gly 


Arg 


Glu 


He 


Ala 


Lys 


Leu 


Val 


Ala 


Gly Val 


Gin 


Ala 








580 










585 










590 






Gly 


Val 


Tyr 


Gly 


Arg 


Ala 


Asp 


Cys 


Val 


Leu 


Asp 


Ser 


Gin 


Leu 


Phe 


Lys 






595 










600 










605 








Glu 


He 


Val 


Glu 


Tyr 


Lys 


Val 


Glu 


Glu 


His 


His 


Arg 


Arg 


His 


Met 


Leu 




610 










615 










620 










Ala 


Ser 


Glu 


Gly 


Phe 


Gin 


Pro 


Leu 


Leu 


Phe 


Ser 













625 630 635 

2 0 (2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 617 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..617 

30 (D) OTHER INFORMATION: / Ceres Seq. ID 1450878 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 





Met 


Ser 


Met 


Ser 


Gin 


Asn 


Arg 


Ala 


Tyr 


Ala 


Asp 


Ser 


Ser 


Arg 


Phe 


Arg 




1 








5 










10 










15 






Phe 


Pro 


Phe 


Phe 


Ser 


Ser 


Ser 


Pro 


Ser 


Pro 


Pro 


Pro 


Ser 


Asp 


Ser 


Pro 


35 








20 










25 










30 








Ala 


Asn 


Gin 


Ser 


Ser 


Thr 


Asn 


Ser 


Ser 


Lys 


Ser 


Lys 


Ala 


Glu 


Pro 


Asp 








35 










40 










45 










Glu 


Pro 


Lys 


Gly 


Ser 


Gly 


Phe 


Asp 


Pro 


Glu 


Ala 


Leu 


Glu 


Arg 


Ala 


Ala 






50 










55 










60 










40 


Lys 


Ala 


Leu 


Arg 


Asp 


lie 


Asn 


Ser 


Ser 


Pro 


His 


Ser 


Lys 


Gin 


Val 


Phe 




65 










70 










75 










80 




Asp 


Leu 


Met 


Arg 


Lys 


Gin 


Glu 


Lys 


Thr 


Arg 


Leu 


Ala 


Glu 


Leu 


Ala 


Ala 












85 










90 










95 






Glu 


Thr 


Ser 


His 


Tyr 


Glu 


Ala 


lie 


Gin 


Ala 


His 


Asn 


Asp 


lie 


Gly 


Arg 


45 








100 










105 










110 








Gin 


Gin 


Lys 


Leu 


Ala 


Glu 


Asp 


Gin 


Arg 


Asn 


Leu 


Leu 


Gin 


Thr 


Gin 


Ala 








115 










120 










125 










Gin 


Thr 


Lys 


Ala 


Gin 


Asn 


Leu 


Arg 


Tyr 


Glu 


Asp 


Glu 


Leu 


Ala 


Arg 


Lys 






130 










135 










140 










50 


Arg 


Gin 


Gin 


Thr 


Asp 


His 


Glu 


Ala 


Gin 


Arg 


His 


His 


Asn 


Val 


Glu 


Leu 




145 










150 










155 










160 




Val 


Lys 


Met 


Gin 


Glu 


Ala 


Ser 


Ser 


lie 


Arg 


Lys 


Glu 


Lys 


Ala 


Lys 


He 












165 










170 










175 






Ala 


Thr 


Glu 


Glu 


Gin 


lie 


Gin 


Ala 


Gin 


His 


Arg 


Gin 


Thr 


Glu 


Lys 


Glu 


55 








180 










185 










190 








Arg 


Ala 


Glu 


Leu 


Glu 


Arg 


Glu 


Thr 


He 


Arg 


Val 


Lys 


Ala 


Met 


Ala 


Glu 








195 










200 










205 










Ala 


Glu 


Gly 


Arg 


Ala 


His 


Glu 


Ala 


Lys 


Leu 


Thr 


Glu 


Glu 


Gin 


Asn 


Arg 






210 










215 










220 










60 


Arg 


Leu 


Leu 


Met 


Glu 


Arg 


lie 


Asn 


Gly 


Glu 


Arg 


Glu 


Lys 


Trp 


Leu 


Ala 




225 










230 










235 










240 




Ala 


lie 


Asn 


Thr 


Met 


Phe 


Ser 


His 


lie 


Glu 


Gly 


Gly 


Phe 


Arg 


Thr 


Leu 
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245 250 255 

Leu Thr Asp Arg Asn Lys Leu lie Met Thr Val Gly Gly Ala Thr Ala 

2 60 2 65 27 0 

Leu Ala Ala Gly Val Tyr Thr Thr Arg Glu Gly Ala Arg Val Thr Trp 
5 275 280 285 

Gly Tyr lie Asn Arg Met Leu Gly Gin Pro Ser Leu lie Arg Glu Ser 

290 295 300 

Ser Met Arg Arg Phe Pro Trp Thr Gly Ser Val Ser Gin Phe Lys Asn 
305 310 315 320 

10 Arg lie Ser Gly Ala Ala Ala Ala Ser Ala Ala Glu Gly Lys Lys Pro 

325 330 335 

Leu Asp Asn Val lie Leu His Thr Ser Leu Lys Lys Arg lie Glu Arg 

340 345 350 

Leu Ala Arg Ala Thr Ala Asn Thr Lys Ser His Gin Ala Pro Phe Arg 
15 355 360 365 

Asn Met Met Phe Tyr Gly Pro Pro Gly Thr Gly Lys Thr Met Val Ala 

370 375 380 

Arg Glu lie Ala Arg Lys Ser Gly Leu Asp Tyr Ala Met Met Thr Gly 
385 " 390 395 400 

20 Gly Asp Val Ala Pro Leu Gly Ser Gin Ala Val Thr Lys lie His Gin 

405 410 415 

lie Phe Asp Trp Ala Lys Lys Ser Asn Arg Gly Leu Leu Leu Phe lie 

420 ~ 425 430 

Asp Glu Ala Asp Ala Phe Leu Cys Glu Arg Asn Ser Thr Tyr Met Ser 
25 435 440 445 

Glu Ala Gin Arg Ser Ala Leu Asn Ala Leu Leu Phe Arg Thr Gly Asp 

450 ~ 455 460 

Gin Ser Arg Asp lie Val Leu Val Leu Ala Thr Asn Arg Pro Gly Asp 
465 " 470 475 ~ 480 

30 Leu Asp Ser Ala Val Thr Asp Arg lie Asp Glu Val lie Glu Phe Pro 

485 490 495 

Leu Pro Gly Glu Glu Glu Arg Phe Lys Leu Leu Asn Leu Tyr Leu Asn 

500 505 510 

Lys Tyr Leu Lys Met Gly Asp Asn Asn Glu Asp Thr Lys Pro Lys Trp 
35 " 515 ' " 520 525 

Ser His Leu Phe Lys Lys Leu Ser Gin Lys lie Thr Val Glu Glu Asp 

530 ~ 535 540 

Leu Thr Asp Lys Val lie Ser Glu Ala Ala Lys Lys Thr Glu Gly Phe 
545 550 555 560 

4 0 Ser Gly Arg Glu lie Ala Lys Leu Val Ala Gly Val Gin Ala Gly Val 

565 570 575 

Tyr Gly Arg Ala Asp Cys Val Leu Asp Ser Gin Leu Phe Lys Glu lie 

580 585 590 

Val Glu Tyr Lys Val Glu Glu His His Arg Arg His Met Leu Ala Ser 
45 595 " 600 605 

Glu Gly Phe Gin Pro Leu Leu Phe Ser 

610 615 
(2) INFORMATION FOR SEQ ID NO: 258: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DRA (genomic) 
55 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 478 

(D) OTHER INFORMATION: / Ceres Seq. ID 1459191 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
60 aaccctcttg gaaagagtct caacacttgc agagaaaaag aacaaggaag atcccggaaa 60 
atggcaacgg cgattgtacg ttcagctctt tcccgagcag tgactcgcgc agtccgaaga 120 
catccgtcgc tcctaagcga aacttttcct cttccgccgg ccatgacgat gcttatgaag 180 
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ctgcgaagtg ggagaagata acttatctgg gtattgctag ttgcactgct ctagctgtct 240 
atgttttatc caagggccat catcacggcg aagacaagga gtttccttgg ggtccggatg 300 
gtctgtttga ggtgaagcac aacaaagagc actgagtctt gcgtggtcat aataacgtct 360 
tctaggttta tttgaaaggc taaaatgttt taccgtattt gttctcaccg tttgtcaacg 420 
5 atttgctact ccaatctctt ttcttttgtt gggaaataaa agttaatact ttgcttgg 
(2) INFORMATION FOR SEQ ID NO: 259: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
15 (B) LOCATION: 1 . . 6 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1459192 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:259: 



Asn 


Pro 


Leu 


Gly 


Lys 


Ser 


Leu 


Asn 


Thr 


Cys 


Arg 


Glu 


Lys 


Glu 


Gin 


Gly 


1 








5 










10 










15 




Arg 


Ser 


Arg 


Lys 


Met 


Ala 


Thr 


Ala 


He 


Val 


Arg 


Ser 


Ala 


Leu 


Ser 


Arg 








20 










25 










30 






Ala 


Val 


Thr 


Arg 


Ala 


Val 


Arg 


Arg 


His 


Pro 


Ser 


Leu 


Leu 


Ser 


Glu 


Thr 






35 










40 










45 








Phe 


Pro 


Leu 


Pro 


Pro 


Ala 


Met 


Thr 


Met 


Leu 


Met 


Lys 


Leu 


Arg 


Ser 


Gly 



25 50 55 60 

Arg Arg 
65 

(2) INFORMATION FOR SEQ ID NO:260: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

35 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 110 

(D) OTHER INFORMATION: / Ceres Seq. ID 1459193 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: 



Pro 


Ser 


Trp 


Lys 


Glu 


Ser 


Gin 


His 


Leu 


Gin 


Arg 


Lys 


Arg 


Thr 


Arg 


Lys 


1 








5 










10 










15 




He 


Pro 


Glu 


Asn 


Gly 


Asn 


Gly 


Asp 


Cys 


Thr 


Phe 


Ser 


Ser 


Phe 


Pro 


Ser 








20 










25 










30 






Ser 


Asp 


Ser 


Arg 


Ser 


Pro 


Lys 


Thr 


Ser 


Val 


Ala 


Pro 


Lys 


Arg 


Asn 


Phe 






35 










40 










45 








Ser 


Ser 


Ser 


Ala 


Gly 


His 


Asp 


Asp 


Ala 


Tyr 


Glu 


Ala 


Ala 


Lys 


Trp 


Glu 




50 










55 










60 










Lys 


He 


Thr 


Tyr 


Leu 


Gly 


He 


Ala 


Ser 


Cys 


Thr 


Ala 


Leu 


Ala 


Val 


Tyr 


65 










70 










75 










80 


Val 


Leu 


Ser 


Lys 


Gly 


His 


His 


His 


Gly 


Glu 


Asp 


Lys 


Glu 


Phe 


Pro 


Trp 










85 










90 










95 




Gly 


Pro 


Asp 


Gly 


Leu 


Phe 


Glu 


Val 


Lys 


His 


Asn 


Lys 


Glu 


His 












100 










105 










110 






(2) 


INFORMATION 


FOR 


SEQ 


ID ] 


MO:261: 

















55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : - 
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(B) LOCATION: 1..741 

(D) OTHER INFORMATION; / Ceres Seq. ID 1461848 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:261: 

attttagtac attgttgacc atctttttcg tatagactac tatctctgat ctcttgcgag 60 

ttaagtcagt aactaggaaa attcagaagc gctctcaatc tcaaaaatat ccatggcggc 120 

gattacagaa tttctaccaa aagagtacgg atatgtcgtt ctcgtcctcg tcttctactg 180 

tttcctcaac ctctggatgg gtgctcaagt cggcagagct cgcaaaaggt acaacgtccc 240 

gtatccaact ctatatgcaa tagaatcaga aaacaaagat gctaagctct tcaactgtgt 300 

tcagagagga catcaaaact ctttagagat gatgccaatg tatttcatac tgatgatcct 360 

cggtgggatg aagcaccctt gtatctgtac tggccttggt ttgctttaca acgttagccg 420 

attcttctac tttaaaggtt atgctactgg agatcccatg aagcgtctta cgatcgggaa 480 

atacggtttc ttggggttgc taggtctgat garatgtacc atctcgtttg gtgtcactct 540 

gatccttgct taagctactc gtttctgggg ttaatgattc tctggtttgc tcgaagaata 600 

tagaaccaat gcttgtaagc tgtccacaaa acttgtgtaa tactttagag tttgtcactt 660 

ttaaaagttt gtaataaatc atggcttcat agaacagttg aaatttcaca tccgtagacg 720 



ttaataaaga tttgaattat g 
(2) I N FORMAT I ON FOR SEQ ID NO: 262: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .14 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1461849 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 



Met 


Ala 


Ala 


He 


Thr 


Glu 


Phe 


Leu 


Pro 


Lys 


Glu 


Tyr 


Gly 


Tyr 


Val 


Val 


1 








5 










10 










15 




Leu 


Val 


Leu 


Val 
20 


Phe 


Tyr 


Cys 


Phe 


Leu 
25 


Asn 


Leu 


Trp 


Met 


Gly 
30 


Ala 


Gin 


Val 


Gly 


Arg 
35 


Ala 


Arg 


Lys 


Arg 


Tyr 
40 


Asn 


Val 


Pro 


Tyr 


Pro 
45 


Thr 


Leu 


Tyr 


Ala 


He 
50 


Glu 


Ser 


Glu 


Asn 


Lys 
55 


Asp 


Ala 


Lys 


Leu 


Phe 
60 


Asn 


Cys 


Val 


Gin 


Arg 


Gly 


His 


Gin 


Asn 


Ser 


Leu 


Glu 


Met 


Met 


Pro 


Met 


Tyr 


Phe 


He 


Leu 


65 










70 










75 










80 


Met 


He 


Leu 


Gly 


Gly 
85 


Met 


Lys 


His 


Pro 


Cys 
90 


He 


Cys 


Thr 


Gly 


Leu 
95 


Gly 


Leu 


Leu 


Tyr 


Asn 
100 


Val 


Ser 


Arg 


Phe 


Phe 
105 


Tyr 


Phe 


Lys 


Gly 


Tyr 
110 


Ala 


Thr 


Gly 


Asp 


Pro 
115 


Met 


Lys 


Arg 


Leu 


Thr 
120 


He 


Gly 


Lys 


Tyr 


Gly 
125 


Phe 


Leu 


Gly 


Leu 


Leu 
130 


Gly 


Leu 


Met 


He 


Cys 
135 


Thr 


He 


Ser 


Phe 


Gly 
140 


Val 


Thr 


Leu 


He 



Leu Ala 
145 

(2) INFORMATION FOR SEQ ID NO:263: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..118 

(D) OTHER INFORMATION: / Ceres Seq. ID 1461850 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO:263: 
Met Gly Ala Gin Val Gly Arg Ala Arg Lys Arg Tyr Asn Val Pro Tyr 
15 10 15 
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Pro 


Thr 


Leu 


Tyr 
20 


Ala 


He 


Glu 


Ser 


Glu 
25 


Asn 


Lys 


Asp 


Ala 


Lys 
30 


Leu 


Phe 


Asn 


Cys 


Val 
35 


Gin 


Arg 


Gly 


His 


Gin 
40 


Asn 


Ser 


Leu 


Glu 


Met 

45 


Met 


Pro 


Met 


Tyr 


Phe 
50 


He 


Leu 


Met 


He 


Leu 
55 


Gly 


Gly 


Met 


Lys 


His 
60 


Pro 


Cys 


He 


Cys 


Thr 


Gly 


Leu 


Gly 


Leu 


Leu 


Tyr 


Asn 


Val 


Ser 


Arg 


Phe 


Phe 


Tyr 


Phe 


Lys 


65 










70 










75 










80 


Gly 


Tyr 


Ala 


Thr 


Gly 
85 


Asp 


Pro 


Met 


Lys 


Arg 
90 


Leu 


Thr 


lie 


Gly 


Lys 
95 


Tyr 


Gly 


Phe 


Leu 


Gly 
100 


Leu 


Leu 


Gly 


Leu 


Met 
105 


He 


Cys 


Thr 


He 


Ser 
110 


Phe 


Gly 


Val 


Thr 


Leu 
115 


He 


Leu 


Ala 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:264: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..74 

(D) OTHER INFORMATION: / Ceres Seq. ID 1461851 





(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO:264 












Met 


Met 


Pro Met 


Tyr 


Phe 


He Leu Met 


He 


Leu 


Gly 


Gly 


Met 


Lys 


His 


1 






5 






10 










15 




Pro 


Cys 


He Cys 


Thr 


Gly 


Leu Gly Leu 


Leu 


Tyr 


Asn 


Val 


Ser 


Arg 


Phe 






20 






25 










30 






Phe 


Tyr 


Phe Lys 


Gly 


Tyr 


Ala Thr Gly 


Asp 


Pro 


Met 


Lys 


Arg 


Leu 


Thr 






35 






40 








45 








He 


Gly 


Lys Tyr 


Gly 


Phe 


Leu Gly Leu 


Leu 


Gly 


Leu 


Met 


He 


Cys 


Thr 




50 








55 






60 










He 


Ser 


Phe Gly 


Val 


Thr 


Leu He Leu 


Ala 














65 








70 


















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:265: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 4 69 

(D) OTHER INFORMATION: / Ceres Seq. ID 1472772 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 



atatagaata taaccatatt ccggacatga agatcgtgac attggtactc gtcgtcttcg 60 

tcatactttc gacatcattc ccggctgcca tcaaagccga agacacggga gatacaggaa 120 

atgtgggagt gacatgtgac gcaaggcagc ttcagccttg ccttgccgcg attacgggag 180 

gaggacaacc ctcgggtgca tgttgtgcaa agcttacaga gcaacagtca tgcctatgtg 240 

gtttcgctaa gaaccctgcg ttcgcacagt acattagctc tccaaacgct cgcaaagtgc 300 

tccttgcttg caatgttgct tatcccactt gttgaaactt atctagattt tataaataaa 360 

taaacgaaag gaaataaatt acattatatc aaacgttatg atacaattca accgtttgtg 420 



ttaatgtact ggcttacatg gttaaataaa gtttaatttc ttggttgcc 
(2) INFORMATION FOR SEQ ID NO: 266: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..110 

5 (D) OTHER INFORMATION: / Ceres Seq. ID 1472773 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 



He 


Glu 


Tyr 


Asn 


His 


He 


Pro 


Asp 


Met 


Lys 


He 


Val 


Thr 


Leu 


Val 


Leu 


1 








5 










10 










15 




Val 


Val 


Phe 


Val 


He 


Leu 


Ser 


Thr 


Ser 


Phe 


Pro 


Ala 


Ala 


He 


Lys 


Ala 








20 










25 










30 






Glu 


Asp 


Thr 


Gly 


Asp 


Thr 


Gly 


Asn 


Val 


Gly 


Val 


Thr 


Cys 


Asp 


Ala 


Arg 






35 










40 










45 








Gin 


Leu 


Gin 


Pro 


Cys 


Leu 


Ala 


Ala 


He 


Thr 


Gly 


Gly 


Gly 


Gin 


Pro 


Ser 




50 










55 










60 










Gly 


Ala 


Cys 


Cys 


Ala 


Lys 


Leu 


Thr 


Glu 


Gin 


Gin 


Ser 


Cys 


Leu 


Cys 


Gly 


65 










70 










75 










80 


Phe 


Ala 


Lys 


Asn 


Pro 


Ala 


Phe 


Ala 


Gin 


Tyr 


He 


Ser 


Ser 


Pro 


Asn 


Ala 










85 










90 










95 




Arg 


Lys 


Val 


Leu 


Leu 


Ala 


Cys 


Asn 


Val 


Ala 


Tyr 


Pro 


Thr 


Cys 












100 










105 










110 






(2) 


INFORMATION 


FOR 


SEQ 


id ; 


NO: 267 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

( B ) TYPE: amino acid 
25 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
30 (B) LOCATION: 1..102 

(D) OTHER INFORMATION: / Ceres Seq. ID 1472774 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 





Met 


Lys 


He 


Val 


Thr 


Leu 


Val 


Leu 


Val 


Val 


Phe 


Val 


He 


Leu 


Ser Thr 




1 








5 










10 










15 


35 


Ser 


Phe 


Pro 


Ala 

20 


Ala 


He 


Lys 


Ala 


Glu 
25 


Asp 


Thr 


Gly 


Asp 


Thr 

30 


Gly Asn 




Val 


Gly 


Val 
35 


Thr 


Cys 


Asp 


Ala 


Arg 
40 


Gin 


Leu 


Gin 


Pro 


Cys 
45 


Leu 


Ala Ala 




He 


Thr 


Gly 


Gly 


Gly 


Gin 


Pro 


Ser 


Gly 


Ala 


Cys 


Cys 


Ala 


Lys 


Leu Thr 


40 




50 










55 










60 










Glu 


Gin 


Gin 


Ser 


Cys 


Leu 


Cys 


Gly 


Phe 


Ala 


Lys 


Asn 


Pro 


Ala 


Phe Ala 




65 










70 










75 








80 




Gin 


Tyr 


He 


Ser 


Ser 
85 


Pro 


Asn 


Ala 


Arg 


Lys 
90 


Val 


Leu 


Leu 


Ala 


Cys Asn 
95 


45 


Val 


Ala 


Tyr 


Pro 
100 


Thr 


Cys 






















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 268 : 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
55 (A) NAME /KEY : - 

(B) LOCATION: 1..1056 

(D) OTHER INFORMATION: / Ceres Seq. ID 1533352 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:268: 
aaaaaaaaaa accaaagcaa aaaaatggct ttgaaactca cttctccgcc ttcagttttc 60 
60 tcacaatcaa ggagattatc ttcttcttcg ttaattccga taaggtcaaa atccacattc 120 
accggatttc gatcgagaac cggtgtttat ttaagcaaaa cgacggcgct tcagtcgtct 180 
acaaaactga gtgtggcggc ggagagtcct gcggcgacaa ttgcgacgga tgattggggg 240 
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15 



20 



25 
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aaagtatcgg cggttctgtt tgatatggac ggtgtgcttt gtaacagtga agatctttct 300 
agacgcgccg ccgtggatgt ttttacggag atgggagttg aagtcactgt ggacgatttc 360 
gttcctttta tgggaacagg tgaagccaag tttttaggag gtgttgcttc agtcaaagaa 420 
gttaaaggat ttgatccaga tgcagctaaa aagagattct ttgaaatata tctcgataag 480 
tatgcgaagc cagaatctgg gattggattt ccaggagcat tggagcttgt tactgagtgt 540 
aagaacaaag gccttaaagt cgctgttgca tctagtgctg accgtatcaa agttgatgcg 600 
aatctgaaag ctgctggttt gtctttgacc atgtttgatg ccattgtttc agcagatgcc 660 
tttgagaatt tgaaaccagc tccagatatt ttcctggctg ctgcaaagat cttaggtgtg 720 
cctaccagcg agtgtgttgt tattgaagat gcgcttgctg gagtccaagc cgcacaagct 780 
gcgaacatga gatgtatagc cgtaaaaact actttatctg aagcaattct taaggatgct 840 
ggtccttcaa tgatacgaga cgatattgga aacatctcaa tcaatgacat tctcactggt 900 
ggctcagatt ctaccagtat gtagtctcaa agaaattcga tggaaaatat cgttcttttc 960 
atgtgtattt tatttcttgt ttactccttt tgaaaacttt tgaataaagg ggctttcttt 1020 
gtaacgagat tacacattta aaacaatctt ttctgt 
(2) INFORMATION FOR SEQ ID NO; 2 69: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
fix) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 



(xi) 



peptide 
1 . . 307 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO:269: 



ID 1533353 





Lys 


Lys 


Lys 


Asn 


Gin 


Ser 


Lys 


Lys 


Met 


Ala 


Leu 


Lys 


Leu 


Thr 


Ser 


Pro 




1 








5 










10 










15 






Pro 


Ser 


Val 


Phe 


Ser 


Gin 


Ser 


Arg 


Arg 


Leu 


Ser 


Ser 


Ser 


Ser 


Leu 


He 


30 








20 










25 










30 








Pro 


He 


Arg 


Ser 


Lys 


Ser 


Thr 


Phe 


Thr 


Gly 


Phe 


Arg 


Ser 


Arg 


Thr 


Gly 








35 










40 










45 










Val 


Tyr 


Leu 


Ser 


Lys 


Thr 


Thr 


Ala 


Leu 


Gin 


Ser 


Ser 


Thr 


Lys 


Leu 


Ser 






50 










55 










60 










35 


Val 


Ala 


Ala 


Glu 


Ser 


Pro 


Ala 


Ala 


Thr 


He 


Ala 


Thr 


Asp 


Asp 


Trp 


Gly 




65 










70 










75 










80 




Lys 


Val 


Ser 


Ala 


Val 


Leu 


Phe 


Asp 


Met 


Asp 


Gly 


Val 


Leu 


Cys 


Asn 


Ser 












85 










90 










95 






Glu 


Asp 


Leu 


Ser 


Arg 


Arg 


Ala 


Ala 


Val 


Asp 


Val 


Phe 


Thr 


Glu 


Met 


Gly 


40 








100 










105 










110 








Val 


Glu 


Val 


Thr 


Val 


Asp 


Asp 


Phe 


Val 


Pro 


Phe 


Met 


Gly 


Thr 


Gly 


Glu 








115 










120 










125 










Ala 


Lys 


Phe 


Leu 


Gly 


Gly 


Val 


Ala 


Ser 


Val 


Lys 


Glu 


Val 


Lys 


Gly 


Phe 






130 










135 










140 










45 


Asp 


Pro 


Asp 


Ala 


Ala 


Lys 


Lys 


Arg 


Phe 


Phe 


Glu 


He 


Tyr 


Leu 


Asp 


Lys 




145 










150 










155 










160 




Tyr 


Ala 


Lys 


Pro 


Glu 


Ser 


Gly 


He 


Gly 


Phe 


Pro 


Gly 


Ala 


Leu 


Glu 


Leu 












165 










170 










175 






Val 


Thr 


Glu 


Cys 


Lys 


Asn 


Lys 


Gly 


Leu 


Lys 


Val 


Ala 


Val 


Ala 


Ser 


Ser 


50 








180 










185 










190 








Ala 


Asp 


Arg 


He 


Lys 


Val 


Asp 


Ala 


Asn 


Leu 


Lys 


Ala 


Ala 


Gly 


Leu 


Ser 








195 










200 










205 










Leu 


Thr 


Met 


Phe 


Asp 


Ala 


He 


Val 


Ser 


Ala 


Asp 


Ala 


Phe 


Glu 


Asn 


Leu 






210 










215 










220 










55 


Lys 


Pro 


Ala 


Pro 


Asp 


He 


Phe 


Leu 


Ala 


Ala 


Ala 


Lys 


He 


Leu 


Gly 


Val 




225 










230 










235 










240 




Pro 


Thr 


Ser 


Glu 


Cys 


Val 


Val 


He 


Glu 


Asp 


Ala 


Leu 


Ala 


Gly 


Val 


Gin 












245 










250 










255 






Ala 


Ala 


Gin 


Ala 


Ala 


Asn 


Met 


Arg 


Cys 


He 


Ala 


Val 


Lys 


Thr 


Thr 


Leu 


60 








260 










265 










270 








Ser 


Glu 


Ala 


He 


Leu 


Lys 


Asp 


Ala 


Gly 


Pro 


Ser 


Met 


He 


Arg 


Asp 


Asp 








275 










280 










285 









WO 00/40695 



PCI7US00/00466 



452 

lie Gly Asn lie Ser lie Asn Asp lie Leu Thr Gly Gly Ser Asp Ser 

290 295 300 

Thr Ser Met 
305 

5 (2) INFORMATION FOR SEQ ID NC:270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..2 99 

15 (D) OTHER INFORMATION: / Ceres Seq. ID 1533354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:270: 
Met Ala Leu Lys Leu Thr Ser Pro Pro Ser Val Phe Ser Gin Ser Arg 
1 5 10 15 

Arg Leu Ser Ser Ser Ser Leu lie Pro lie Arg Ser Lys Ser Thr Phe 
20 20 25 30 

Thr Gly Phe Arg Ser Arg Thr Gly Val Tyr Leu Ser Lys Thr Thr Ala 

35 40 45 

Leu Gin Ser Ser Thr Lys Leu Ser Val Ala Ala Glu Ser Pro Ala Ala 
50 55 60 

25 Thr lie Ala Thr Asp Asp Trp Gly Lys Val Ser Ala Val Leu Phe Asp 
65 70 75 80 

Met Asp Gly Val Leu Cys Asn Ser Glu Asp Leu Ser Arg Arg Ala Ala 

8 5 90 95 

Val Asp Val Phe Thr Glu Met Gly Val Glu Val Thr Val Asp Asp Phe 
30 1 100 105 110 

Val Pro Phe Met Gly Thr Gly Glu Ala Lys Phe Leu Gly Gly Val Ala 

115 120 125 

Ser Val Lys Glu Val Lys Gly Phe Asp Pro Asp Ala Ala Lys Lys Arg 
130 135 140 

35 Phe Phe Glu lie Tyr Leu Asp Lys Tyr Ala Lys Pro Glu Ser Gly lie 
145 150 155 160 

Gly Phe Pro Gly Ala Leu Glu Leu Val Thr Glu Cys Lys Asn Lys Gly 

165 170 175 

Leu Lys Val Ala Val Ala Ser Ser Ala Asp Arg lie Lys Val Asp Ala 
40 ^ 180 185 190 

Asn Leu Lys Ala Ala Gly Leu Ser Leu Thr Met Phe Asp Ala lie Val 

195 200 205 

Ser Ala Asp Ala Phe Glu Asn Leu Lys Pro Ala Pro Asp lie Phe Leu 
210 215 220 

45 Ala Ala Ala Lys lie Leu Gly Val Pro Thr Ser Glu Cys Val Val lie 
225 230 235 240 

Glu Asp Ala Leu Ala Gly Val Gin Ala Ala Gin Ala Ala Asn Met Arg 

245 250 255 

Cys lie Ala Val Lys Thr Thr Leu Ser Glu Ala lie Leu Lys Asp Ala 
50 2 60 2 65 27 0 

Gly Pro Ser Met lie Arg Asp Asp lie Gly Asn He Ser He Asn Asp 

275 280 285 

He Leu Thr Gly Gly Ser Asp Ser Thr Ser Met 
290 295 
5 5 (2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
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(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 219 

(D) OTHER INFORMATION: / Ceres Seq. ID 1533355 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 



Met 


Asp 


Gly 


Val 


Leu 


Cys 


Asn 


Ser 


Glu 


Asp 


Leu 


Ser 


Arg 


Arg 


Ala 


Ala 


1 








5 










1 0 










15 




Val 


Asp 


Val 


Phe 


Thr 


Glu 


Met 


Gly 


Val 


Glu 


Val 


Thr 


Val 


Asp 


Asp 


Phe 








20 










25 










30 






Val 


Pro 


Phe 


Met 


Gly 


Thr 


Gly 


Glu 


Ala 


Lys 


Phe 


Leu 


Gly 


Gly 


Val 


Ala 






35 










40 










45 








Ser 


Val 


Lys 


Glu 


Val 


Lys 


Gly 


Phe 


Asp 


Pro 


Asp 


Ala 


Ala 


Lys 


Lys 


Arg 




50 










55 










60 










Phe 


Phe 


Glu 


He 


Tyr 


Leu 


Asp 


Lys 


Tyr 


Ala 


Lys 


Pro 


Glu 


Ser 


Gly 


He 


65 










70 










75 










80 


Gly 


Phe 


Pro 


Gly 


Ala 


Leu 


Glu 


Leu 


Val 


Thr 


Glu 


Cys 


Lys 


Asn 


Lys 


Gly 










85 










90 










95 




Leu 


Lys 


Val 


Ala 


Val 


Ala 


Ser 


Ser 


Ala 


Asp 


Arg 


He 


Lys 


Val 


Asp 


Ala 








100 










105 










110 






Asn 


Leu 


Lys 


Ala 


Ala 


Gly 


Leu 


Ser 


Leu 


Thr 


Met 


Phe 


Asp 


Ala 


He 


Val 






115 










12 0 










125 








Ser 


Ala 


Asp 


Ala 


Phe 


Glu 


Asn 


Leu 


Lys 


Pro 


Ala 


Pro 


Asp 


He 


Phe 


Leu 




130 










135 










140 










Ala 


Ala 


Ala 


Lys 


He 


Leu 


Gly 


Val 


Pro 


Thr 


Ser 


Glu 


Cys 


Val 


Val 


He 


145 










150 










155 










160 


Glu 


Asp 


Ala 


Leu 


Ala 


Gly 


Val 


Gin 


Ala 


Ala 


Gin 


Ala 


Ala 


Asn 


Met 


Arg 










165 










170 










175 




Cys 


He 


Ala 


Val 


Lys 


Thr 


Thr 


Leu 


Ser 


Glu 


Ala 


He 


Leu 


Lys 


Asp 


Ala 








180 










185 










190 






Gly 


Pro 


Ser 


Met 


He 


Arg 


Asp 


Asp 


He 


Gly 


Asn 


He 


Ser 


He 


Asn 


Asp 






195 










200 










205 








He 


Leu 


Thr 


Gly 


Gly 


Ser 


Asp 


Ser 


Thr 


Ser 


Met 













210 215 
(2) INFORMATION FOR SEQ ID NO: 272: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1420 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 1420 

(D) OTHER INFORMATION: / Ceres Seq. ID 1534544 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 



aaagattctt ggtctccgat ccagtcacct gaagattctc ggagcttctt ttgaccacac 60 

actttcaatg gcggatgctg tgaacgctca aactccatcg ctctccgagc aatatcattt 120 

ggagaaagaa gtgaagcaag acacaagtgc aaagcctgtt gaagtgaaag aggtggcacc 180 

agaagttact acacaagctg aagaggttaa gacggagcaa gctaaggaag aatctcctgt 240 

tgaggaagcg gtttctgtag ttgaagagaa gtctgaatct gctcctgaat caacggaagt 300 

ggcttctgag gctcctgctg cagcggaaga caatgctgaa gagactcctg ctgctgctga 360 

agaaaataat gacgaaaacg ctagtgaaga agttgctgaa gaaacccctg atgagatcaa 420 

gcttgagaca gctcctgctt gatctcatca ggggtttctt tttttttttt ttttttgaat 480 

tttttcacca agtgttcgtt cacggcattg cttgtcctcc agtatgagta tctccggtgc 540 

tgcagttggt tcaggcagaa atttaagaag agcggtagag tttgggaaaa ctcatgtggt 600 

taggcctaaa gggaaacatc aagcaactat tgtctggtta catgggcttg gggacaatgg 660 

ctcgagctgg tcccagcttt tggagaccct tccccttcca aatatcaaat ggatttgccc 720 

gactgctcct tctcaaccaa taagtttatt tggtggtttt ccctccacag cttggtttga 780 

tgttgtggac atcaatgaag atggacctga tgatatggaa ggattggatg tggctgctgc 840 

acatgttgca aatctgttgt cgaatgagcc tgctgacatt aaattaggtg ttggaggatt 900 

cagcatgggt gcggcgacat ctctatattc tgcaacttgt tttgctctcg gtaaatatgg 960 

aaatggcaat ccatacccta tcaatttaag cgcaatcata ggcttaagcg gctggcttcc 1020 

ttgtgcaaag acattggctg gcaaactaga agaggaacag atcaagaacc gagctgcatc 1080 
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gttacccatt gttgtctgtc atggaaaagc tgatgatgtg gtaccgrtca agtttgggga 1140 
gaaatcttca caggctttgc tttcaaatgg gtttaagaag gtgaccttca aaccttacag 1200 
tgcacttggt caccacacaa tcccacagga gttggatgag ttgtgcgcat ggttgacatc 1260 
cacgctcagc ctcgaaggtt gatacttcct atgatgtagc tttctgatga aaacccttca 1320 
actcttgaga gtttgattga attggatgtt tcaggatttc acaatgtttt cattggaata 1380 
tttgtgtaag acacattcgt ctagtagcaa atcttctcct 
(2) INFORMATION FOR SEQ ID NO: 27 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 
-(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..14 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1534545 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 



Lys 


lie 


Leu 


Gly 


Leu 


Arg 


Ser 


Ser 


His 


Leu 


Lys 


He 


Leu 


Gly 


Ala 


Ser 


1 








5 










10 










15 




Phe 


Asp 


His 


Thr 


Leu 


Ser 


Met 


Ala 


Asp 


Ala 


Val 


Asn 


Ala 


Gin 


Thr 


Pro 








20 










25 










30 






Ser 


Leu 


Ser 


Glu 


Gin 


Tyr 


His 


Leu 


Glu 


Lys 


Glu 


Val 


Lys 


Gin 


Asp 


Thr 






35 










40 










45 








Ser 


Ala 


Lys 


Pro 


Val 


Glu 


Val 


Lys 


Glu 


Val 


Ala 


Pro 


Glu 


Val 


Thr 


Thr 




50 










55 










60 










Gin 


Ala 


Glu 


Glu 


Val 


Lys 


Thr 


Glu 


Gin 


Ala 


Lys 


Glu 


Glu 


Ser 


Pro 


Val 


65 










70 










75 










80 


Glu 


Glu 


Ala 


Val 


Ser 


Val 


Val 


Glu 


Glu 


Lys 


Ser 


Glu 


Ser 


Ala 


Pro 


Glu 










85 










90 










95 




Ser 


Thr 


Glu 


Val 


Ala 


Ser 


Glu 


Ala 


Pro 


Ala 


Ala 


Ala 


Glu 


Asp 


Asn 


Ala 








100 










105 










110 






Glu 


Glu 


Thr 


Pro 


Ala 


Ala 


Ala 


Glu 


Glu 


Asn 


Asn 


Asp 


Glu 


Asn 


Ala 


Ser 






115 










120 










125 








Glu 


Glu 


Val 


Ala 


Glu 


Glu 


Thr 


Pro 


Asp 


Glu 


He 


Lys 


Leu 


Glu 


Thr 


Ala 



130 135 140 

Pro Ala 
145 

(2) INFORMATION FOR SEQ ID NO: 27 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..252 

(D) OTHER INFORMATION: / Ceres Seq. ID 1534546 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:274: 



Met 


Ser 


He 


Ser 


Gly 


Ala 


Ala 


Val 


Gly 


Ser 


Gly Arg 


Asn 


Leu 


Arg 


Arg 


1 








5 










10 










15 




Ala 


Val 


Glu 


Phe 
20 


Gly 


Lys 


Thr 


His 


Val 

25 


Val 


Arg 


Pro 


Lys 


Gly 
30 


Lys 


His 


Gin 


Ala 


Thr 

35 


He 


Val 


Trp 


Leu 


His 
40 


Gly 


Leu 


Gly 


Asp 


Asn 
45 


Gly 


Ser 


Ser 


Trp 


Ser 
50 


Gin 


Leu 


Leu 


Glu 


Thr 

55 


Leu 


Pro 


Leu 


Pro 


Asn 
60 


He 


Lys 


Trp 


He 


Cys 


Pro 


Thr 


Ala 


Pro 


Ser 


Gin 


Pro 


He 


Ser 


Leu 


Phe 


Gly 


Gly 


Phe 


Pro 


65 










70 










75 










80 


Ser 


Thr 


Ala 


Trp 


Phe 
85 


Asp 


Val 


Val 


Asp 


He 
90 


Asn 


Glu 


Asp 


Gly 


Pro 
95 


Asp 
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Asp Met Glu Gly Leu Asp Val Ala Ala 
100 105 
Ser Asn Glu Pro Ala Asp lie Lys Leu 
115 120 
5 Gly Ala Ala Thr Ser Leu Tyr Ser Ala 
130 135 
Tyr Gly Asn Gly Asn Pro Tyr Pro lie 
145 " ^ 150 

Leu Ser Gly Trp Leu Pro Cys Ala Lys 
10 165 

Glu Glu Gin lie Lys Asn Arg Ala Ala 
180 185 
His Gly Lys Ala Asp Asp Val Val Pro 
195 200 
15 Ser Gin Ala Leu Leu Ser Asn Gly Phe 
210 215 
Tyr Ser Ala Leu Gly His His Thr lie 
225 230 
Cys Ala Trp Leu Thr Ser Thr Leu Ser 
20 245 

(2) INFORMATION FOR SEQ ID NO: 275: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino aci 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS; 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
30 (B) LOCATION: 1..155 

(D) OTHER INFORMATION: / Ceres Seq. ID 1534547 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 



Met 


Glu 


Gly 


Leu 


Asp 


Val 


Ala 


Ala 


Ala 


His 


Val 


Ala 


Asn 


Leu 


Leu 


Ser 


1 








5 










10 










15 




Asn 


Glu 


Pro 


Ala 


Asp 


He 


Lys 


Leu 


Gly 


Val 


Gly 


Gly 


Phe 


Ser 


Met 


Gly 








20 










25 










30 






Ala 


Ala 


Thr 


Ser 


Leu 


Tyr 


Ser 


Ala 


Thr 


Cys 


Phe 


Ala 


Leu 


Gly 


Lys 


Tyr 






35 










40 










45 








Gly 


Asn 


Gly 


Asn 


Pro 


Tyr 


Pro 


He 


Asn 


Leu 


Ser 


Ala 


He 


lie 


Gly 


Leu 




50 










55 










60 










Ser 


Gly 


Trp 


Leu 


Pro 


Cys 


Ala 


Lys 


Thr 


Leu 


Ala 


Gly 


Lys 


Leu 


Glu 


Glu 


65 










70 










75 










80 


Glu 


Gin 


He 


Lys 


Asn 


Arg 


Ala 


Ala 


Ser 


Leu 


Pro 


He 


Val 


Val 


Cys 


His 










85 










90 










95 




Gly 


Lys 


Ala 


Asp 


Asp 


Val 


Val 


Pro 


Phe 


Lys 


Phe 


Gly 


Glu 


Lys 


Ser 


Ser 








100 










105 










110 






Gin 


Ala 


Leu 


Leu 


Ser 


Asn 


Gly 


Phe 


Lys 


Lys 


Val 


Thr 


Phe 


Lys 


Pro 


Tyr 






115 










120 










125 








Ser 


Ala 


Leu 


Gly 


His 


His 


Thr 


He 


Pro 


Gin 


Glu 


Leu 


Asp 


Glu 


Leu 


Cys 




130 










135 










140 










Ala 


Trp 


Leu 


Thr 


Ser 


Thr 


Leu 


Ser 


Leu 


Glu 


Gly 













145 150 155 

(2) INFORMATION FOR SEQ ID NO:276: 
(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
60 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..1592 



455 

Ala His Val Ala Asn Leu Leu 
110 

Gly Val Gly Gly Phe Ser Met 
125 

Thr Cys Phe Ala Leu Gly Lys 
140 

Asn Leu Ser Ala He He Gly 
155 160 
Thr Leu Ala Gly Lys Leu Glu 
17 0 J 175 

Ser Leu Pro He Val Val Cys 
190 

Phe Lys Phe Gly Glu Lys Ser 
205 

Lys Lys Val Thr Phe Lys Pro 
220 

Pro Gin Glu Leu Asp Glu Leu 
235 240 
Leu Glu Gly 
250 



ds 



WO 00/40695 



PCT/US00/00466 



456 

(D) OTHER INFORMATION: / Ceres Seq. ID 1567172 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: 

gtagtctctc tttttataac cacttctcga aaactgaaac ctttgtagag agaacccata 60 

gttcgataaa acattctttt tgcaactgag acttggcaac ttggttttac tcaaagtaag 120 

5 atttcttctt gttgatggtg ataagttcaa tatgatgaaa agtttggcta gtgcggttgg 180 

agggaagacg gcgagggcat gtgatagctg cgtgaagagg cgggcacgtt ggtattgcgc 240 

aGctgatgat gcctttcttt gccatgcttg tgacggttcg gtccactcgg caaaccctct 300 

tgctcgtaga cacgagagag ttcgcttgaa atcggctagc gccggaaaat atcgccatgc 360 

ctcgccgcct caccaagcca cgtggcatca gggatttaca cgtaaagctc ggaccccacg 420 

10 tggaggcaag aagagccaca cgatggtttt tcatgatctt gtgccggaga tgagcacgga 480 

ggatcaagcg gagagttacg aggtggaaga gcagctcata tttgaggtgc cggtgatgaa 540 

ctcgatggtt gaggagcaat gctttaacca atccctggag aaacagaatg agtttccaat 600 

gatgccctta agtttcaaga gtagtgacga agaagatgac gacaacgctg agagttgtct 660 

gaatggtttg ttcccaaccg acatggaact agctcagttc acagctgacg tggagactct 720 

15 actcggtgga ggggatcgag agtttcattc catagaagaa ctagggttag gtgagatgtt 780 

aaagatcgaa aaagaggagg tggaggaaga ggaaggagtt gtgacaagag aagtgcatga 840 

tcaagatgaa ggtgatgaga catccccatt tgaaataagc tttgactacg agtacacaca 900 

caagaccaca ttcgatgaag gagaagaaga tgagaaagaa gacgtgatga agaatgtgat 960 

ggagatggga gtgaatgaga tgagtggtgg gattaaagaa gagaagaagg agaaggctct 1020 

20 tatgcttaga ttggactatg aatcagtcat ttccacttgg ggaggccaag ggatcccatg 1080 

gaccgcccgg gtgccatctg aaatagacct cgacatggtt tgtttcccaa cccataccat 1140 

gggtgaaagt ggagcagagg ctcatcatca caaccacttc cgcggcctaG ggttacacct 1200 

aggagatgct ggggatggag gaagagaggc tagggtttca agataccgag agaaaaggag 1260 

gacaaggttg ttctccaaga agataaggta cgaggtacgt aaattgaatg cagataaaag 1320 

25 gcctcgcatg aaaggaaggt tcgtcaagag atcttcaatt ggtgttgctc actaaagaac 1380 

ttaattaatt atggatatta aattactttg ctctcatctt gcttttttgt tgctatagtt 1440 

ttggtgattg ttagctttct ttttctgcat tcatagagaa ttttgcacgt ttttgtgagc 1500 

tacgtatgta cataaatata tcaccaaaaa atgtgactat cttgtaagca ctgatttata 1560 



tagtcgatat aacgtgaatt ttgattgctg gt 
3 0 (2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 07 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 407 

40 (D) OTHER INFORMATION: / Ceres Seq. ID 1567173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Met 


Met 


Lys 


Ser 


Leu 


Ala 


Ser 


Ala 


Val 


Gly 


Gly 


Lys 


Thr 


Ala 


Arg 


Ala 


1 








5 










10 










15 




Cys 


Asp 


Ser 


Cys 


Val 


Lys 


Arg 


Arg 


Ala 


Arg 


Trp 


Tyr 


Cys 


Ala 


Ala 


Asp 








20 










25 










30 






Asp 


Ala 


Phe 


Leu 


Cys 


His 


Ala 


Cys 


Asp 


Gly 


Ser 


Val 


His 


Ser 


Ala 


Asn 






35 










40 










45 








Pro 


Leu 


Ala 


Arg 


Arg 


His 


Glu 


Arg 


Val 


Arg 


Leu 


Lys 


Ser 


Ala 


Ser 


Ala 




50 










55 










60 










Gly 


Lys 


Tyr 


Arg 


His 


Ala 


Ser 


Pro 


Pro 


His 


Gin 


Ala 


Thr 


Trp 


His 


Gin 


65 










70 










75 










80 


Gly 


Phe 


Thr 


Arg 


Lys 


Ala 


Arg 


Thr 


Pro 


Arg 


Gly 


Gly 


Lys 


Lys 


Ser 


His 










85 










90 










95 




Thr 


Met 


Val 


Phe 


His 


Asp 


Leu 


Val 


Pro 


Glu 


Met 


Ser 


Thr 


Glu 


Asp 


Gin 








100 










105 










110 






Ala 


Glu 


Ser 


Tyr 


Glu 


Val 


Glu 


Glu 


Gin 


Leu 


lie 


Phe 


Glu 


Val 


Pro 


Val 






115 










120 










125 








Met 


Asn 


Ser 


Met 


Val 


Glu 


Glu 


Gin 


Cys 


Phe 


Asn 


Gin 


Ser 


Leu 


Glu 


Lys 



130 135 140 



60 Gin Asn Glu Phe Pro Met Met Pro Leu Ser Phe Lys Ser Ser Asp Glu 
145 150 155 160 

Glu Asp Asp Asp Asn Ala Glu Ser Cys Leu Asn Gly Leu Phe Pro Thr 
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165 170 175 

Asp Met Glu Leu Ala Gin Phe Thr Ala Asp Val Glu Thr Leu Leu Gly 

180 185 190 

Gly Gly Asp Arg Glu Phe His Ser lie Glu Glu Leu Gly Leu Gly Glu 
5 195 200 205 

Met Leu Lys lie Glu Lys Glu Glu Val Glu Glu Glu Glu Gly Val Val 

210 215 220 

Thr Arg Glu Val His Asp Gin Asp Glu Gly Asp Glu Thr Ser Pro Phe 
225 230 235 240 

10 Glu lie Ser Phe Asp Tyr Glu Tyr Thr His Lys Thr Thr Phe Asp Glu 

245 250 255 

Gly Glu Glu Asp Glu Lys Glu Asp Val Met Lys Asn Val Met Glu Met 

260 265 270 

Gly Val Asn Glu Met Ser Gly Gly lie Lys Glu Glu Lys Lys Glu Lys 
15 275 280 285 

Ala Leu Met Leu A.rg Leu Asp Tyr Glu Ser Val lie Ser Thr Trp Gly 

290 295 300 

Gly Gin Gly lie Pro Trp Thr Ala Arg Val Pro Ser Glu lie Asp Leu 
305 310 315 320 

2 0 Asp Met Val Cys Phe Pro Thr His Thr Met Gly Glu Ser Gly Ala Glu 

325 330 335 

Ala His His His Asn His Phe Arg Gly Leu Gly Leu His Leu Gly Asp 

340 345 350 

Ala Gly Asp Gly Gly Arg Glu Ala Arg Val Ser Arg Tyr Arg Glu Lys 
2 5 355 360 365 

Arg Arg Thr Arg Leu Phe Ser Lys Lys lie Arg Tyr Glu Val Arg Lys 

370 375 380 

Leu Asn Ala Asp Lys Arg Pro Arg Met Lys Gly Arg Phe Val Lys Arg 
385 390 395 400 

30 Ser Ser lie Gly Val Ala His 

405 

(2) INFORMATION FOR SEQ ID NO: 27 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 6 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 0 (A) NAME/KEY: peptide 

(B) LOCATION: 1. . 4 06 

(D) OTHER INFORMATION: / Ceres Seq. ID 1567174 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 
Met Lys Ser Leu Ala Ser Ala Val Gly Gly Lys Thr Ala Arg Ala Cys 
45 1 5 10 15 

Asp Ser Cys Val Lys Arg Arg Ala Arg Trp Tyr Cys Ala Ala Asp Asp 

20 25 30 

Ala Phe Leu Cys His Ala Cys Asp Gly Ser Val His Ser Ala Asn Pro 
35 40 45 

50 Leu Ala Arg Arg His Glu Arg Val Arg Leu Lys Ser Ala Ser Ala Gly 
50 55 60 

Lys Tyr Arg His Ala Ser Pro Pro His Gin Ala Thr Trp His Gin Gly 
65 70 75 80 

Phe Thr Arg Lys Ala Arg Thr Pro Arg Gly Gly Lys Lys Ser His Thr 
55 85 90 95 

Met Val Phe His Asp Leu Val Pro Glu Met Ser Thr Glu Asp Gin Ala 

100 105 110 

Glu Ser Tyr Glu Val Glu Glu Gin Leu lie Phe Glu Val Pro Val Met 
115 120 125 

60 Asn Ser Met Val Glu Glu Gin Cys Phe Asn Gin Ser Leu Glu Lys Gin 
130 135 140 

Asn Glu Phe Pro Met Met Pro Leu Ser Phe Lys Ser Ser Asp Glu Glu 
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145 










150 










155 










160 




Asp 


Asp 


Asp 


Asn 


Ala 


Glu 


Ser 


Cys 


Leu 


Asn 


Gly 


Leu 


Phe 


Pro 


Thr 


Asp 












1 65 










i / u 










1 / 3 






Met 


Glu 


Leu 


Ala 


Gin 


Phe 


Thr 


Ala 


Asp 


Val 


Glu 


Thr 


Leu 


Leu 


Gly 


Gly 


c 
O 








18 0 










IOC 

loo 










i y u 








Gly 


Asp 


Arg 


Glu 


Phe 


His 


Ser 


He 


Glu 


Glu 


Leu 


Gly 


Leu 


Gly 


Glu 


Met 








195 










2 0 0 










o r\ c 










Leu 


Lys 


He 


Glu 


Lys 


Glu 


Glu 


Val 


Glu 


Glu 


Glu 


Glu 


Gly 


Val 


Val 


Thr 






210 










215 










22 0 










1 u 


Arg 


Glu 


Val 


His 


Asp 


Gin 


Asp 


Glu 


Gly 


Asp 


Glu 


Thr 


Ser 


Pro 


Phe 


Glu 




225 










230 










235 










24 0 




lie 


Ser 


Phe 


Asp 


Tyr 


Glu 


Tyr 


Thr 


His 


Lys 


Thr 


Thr 


Phe 


Asp 


Glu 


Gly 












24 5 










250 










255 






Glu 


Glu 


Asp 


Glu 


Lys 


Glu 


Asp 


Val 


Met 


Lys 


Asn 


Val 


Met 


Glu 


Met 


Gly 


1 IT 

15 








260 










265 










270 








Val 


Asn 


Glu 


Met 


Ser 


Gly 


Gly 


He 


Lys 


Glu 


Glu 


Lys 


Lys 


Glu 


Lys 


Ala 








275 










280 










285 










Leu 


Met 


Leu 


Arg 


Leu 


Asp 


Tyr 


Glu 


Ser 


Val 


He 


Ser 


Thr 


Trp 


Gly 


Gly 






2 90 










2 95 










o n a 

30 0 










z(J 


Gin 


Gly 


He 


Pro 


Trp 


Thr 


Ala 


Arg 


Val 


Pro 


Ser 


Glu 


He 


Asp 


Leu 


Asp 




305 










310 










315 










320 




Met 


Val 


Cys 


Phe 


Pro 


Thr 


His 


Thr 


Met 


Gly 


Glu 


Ser 


Gly 


Ala 


Glu 


Ala 












325 










330 










335 






His 


His 


His 


Asn 


His 


Phe 


Arg 


Gly 


"Leu 


Gly 


Leu 


His 


Leu 


Gly 


Asp 


Ala 


z o 








34 0 










34 5 










350 








Gly 


Asp 


Gly 


Gly 


Arg 


Glu 


Ala 


A.rg 


Val 


Ser 


Arg 


Tyr 


Arg 


Glu 


Lys 


Arg 








355 










360 










365 










Arg 


Thr 


Arg 


Leu 


Phe 


Ser 


Lys 


Lys 


He 


Arg 


Tyr 


Glu 


Val 


Arg 


Lys 


Leu 






370 










375 










380 










30 


Asn 


Ala 


Asp 


Lys 


Arg 


Pro 


Arg 


Met 


Lys 


Gly Arg 


Phe 


Val 


Lys 


Arg 


Ser 




385 










390 










395 










400 




Ser 


He 


Gly 


Val 


Ala 


His 























405 



(2) INFORMATION FOR SEQ ID NO: 27 9: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
4 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..310 

(D) OTHER INFORMATION: / Ceres Seq. ID 1567175 
4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 9: 



Met 


Val 


Phe 


His 


Asp 


Leu 


Val 


Pro 


Glu 


Met 


Ser 


Thr 


Glu 


Asp 


Gin 


Ala 


1 








5 










10 










15 




Glu 


Ser 


Tyr 


Glu 


Val 


Glu 


Glu 


Gin 


Leu 


He 


Phe 


Glu 


Val 


Pro 


Val 


Met 








20 










25 










30 






Asn 


Ser 


Met 


Val 


Glu 


Glu 


Gin 


Cys 


Phe 


Asn 


Gin 


Ser 


Leu 


Glu 


Lys 


Gin 






35 










40 










45 








Asn 


Glu 


Phe 


Pro 


Met 


Met 


Pro 


Leu 


Ser 


Phe 


Lys 


Ser 


Ser 


Asp 


Glu 


Glu 




50 










55 










60 










Asp 


Asp 


Asp 


Asn 


Ala 


Glu 


Ser 


Cys 


Leu 


Asn 


Gly 


Leu 


Phe 


Pro 


Thr 


Asp 


65 










70 










75 










80 


Met 


Glu 


Leu 


Ala 


Gin 


Phe 


Thr 


Ala 


Asp 


Val 


Glu 


Thr 


Leu 


Leu 


Gly 


Gly 










85 










90 










95 




Gly 


Asp 


Arg 


Glu 


Phe 


His 


Ser 


He 


Glu 


Glu 


Leu 


Gly 


Leu 


Gly 


Glu 


Met 








100 










105 










110 






Leu 


Lys 


He 


Glu 


Lys 


Glu 


Glu 


Val 


Glu 


Glu 


Glu 


Glu 


Gly 


Val 


Val 


Thr 






115 










120 










125 








Arg 


Glu 


Val 


His 


Asp 


Gin 


Asp 


Glu 


Gly 


Asp 


Glu 


Thr 


Ser 


Pro 


Phe 


Glu 
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130 










135 










140 










lie 


Qo r 
OC r 


Pho 
rr lie 


Aon 
H5p 


i yr 


f 1 n 

LjX Li 


i yr 


1 1 1 X 


U-| O 

n x o 


L y s 


1 1 IX 


Thr 
1 1 IX 


Pho 


Tx en 
no £J 


fin 
KjJ- Li 


f l \/ 
ljx y 


145 










150 










155 










160 


f 1 n 
LjX Li 


f 1 n 
Lj X Li 


Asp 


fin 
kjr X Li 


Lys 

IDC) 


f i n 

LjX Li 




Val 


L. 


Lys 

X / u 


rA.o 1 1 


V a X 


rJc L. 


fin 
LjX Li 


Mot- 
ive l. 

1 7 S 

X / D 


lj x y 


V ci _L 


7\ c r^i 


fin 


L v lfci L 

180 


o e x 


f 1 \/ 

ljx y 




-Tip 
X X C 


Lys 
185 


fin 


fin 
UlU 


Lys 


T 

Lys 


fin 
\j X Li 

190 


Lys 


A 1 a 
M.X ci 


Leu 


1X1 e L. 


Leu 
195 


7\ r /-t 
M.X Lj 


Leu 




i yr 


f 1 n 
oiu 

200 


Oci 


XT-. I 
V a. X 


T 1 
lie 


Q o r 


Th r 

i nr 
205 


T m 

i rp 


Kj± y 


f 1 \r 

biy 


(j±n 


Ljx y 
210 


Tin 

i x e 


Pro 


Trp 


T H r 

i nr 


7\ I o 
/AX d 

215 


Ar g 


v a x 


Pro 


oer 


fin 
blU 

220 


T 1 c± 

x xe 


Asp 


Leu 


Asp 


ixie u 


Vdl 


Cys 


irne 


Pro 


i nr 


nlS 


i nr 


lyie l. 


P 1 w 

x y 


fin 


Ser 


f 1 w 

LjX y 


7\ 1 2 

ifAi a 


fin 

LjIU 




225 










230 










235 










240 


rllS 


nlS 


nlS 


As n 


ril S 


ir ne 


Arg 




Leu 


250 


Leu 


nlS 


Leu 


b ry 


Asp 

? s s 


A_i_a 


Gly 


Asp 


Gly 


Gly 
260 


Arg 


Glu 


Ala 


Arg 


Val 
265 


Ser 


Arg 


Tyr 


Arg 


Glu 

270 


Lys 


Arg 


Arg 


Thr 


Arg 

275 


Leu 


Phe 


Ser 


Lys 


Lys 
280 


He 


Arg 


Tyr 


Glu 


Val 
285 


Arg 


Lys 


Leu 


Asn 


Ala 
290 


Asp 


Lys 


Arg 


Pro 


Arg 
295 


Met 


Lys 


Gly 


Arg 


Phe 
300 


Val 


Lys 


Arg 


Ser 


Ser 


He 


Gly 


Val 


Ala 


His 























305 310 
(2) INFORMATION FOR SEQ ID NO: 280: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 520 

(D) OTHER INFORMATION: / Ceres Seq. ID 15675o5 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 



aaaaattccg cagaattgat tategtcatt tcgttaagga tctctgtggg gtttgatcga 60 

atttcatagc gaaggagaga gtgagagaga gattgaagca gtaatggcag gaacatctgg 120 

attgetcaac gcagtgaagc caaagatcca aacgattgat attcaggecg ccgctggatg 180 

gggaatcgee gccgcagccg gtgccatctg ggtcgtccaa ccatttggtt ggataaagaa 240 

gacattcatt gacccacccc caactgaaga gaagtgagat atcaccaaga ttcaagacta 300 

agtgaagagt tttccagttt ctaagtttgt taccgcctct ttggcgtgga gctgaataaa 360 

ttttggttct tttctggggc tttctggttt catttcttca atgtttgact gatttgtctt 420 

ctcaaatgtt ataacaaaca agaaaagcac ttaccccaag ttacttgaaa agagaaccca 480 



ttttgcattt atcatgagtt attatttTcc atgtcaaagt 
(2) INFORMATION FOR SEQ ID NO: 281: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1.-57 

(D) OTHER INFORMATION: / Ceres Seq. ID 1567536 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:281: 
Met Ala Gly Thr Ser Gly Leu Leu Asn Ala Val Lys Pro Lys lie Gin 
15 10 15 

Thr lie Asp lie Gin Ala Ala Ala Gly Trp Gly lie Ala Ala Ala Ala 

20 25 30 

Gly Ala lie Trp Val Val Gin Pro Phe Gly Trp lie Lys Lys Thr Phe 
"35 4 0 4 5 
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aaatctctaa 


agtttcaatt 


60 


atttctgcaa 


ctttgccttc 


120 


attcaaaaac 


tcccattttc 


180 


act cgaaacc 


ctagcagcat 


240 


tcgactgatc 


tcgacgttgt 


300 


ttaggtctta 


ttggtttggg 


360 


catcacggca 


attgacaaat 


420 


gtt ct ccacg 


tggttcacat 


480 


gaaaattgt c 


aagaaat cag 


540 


atacTttcat 


ctttggaaga 


600 


ttgttgtttc 


tagt ccaata 


660 


ttttggttgt 


gagcaaaatc 


720 


gtttt 







460 

He Asp Pro Pro Pro Thr Glu Glu Lys 

50 55 
(2) INFORMATION FOR SEQ ID NO: 282: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

10 (ix) FEATURE: 

. (A) NAME /KEY : - 

(B) LOCATION: 1 . .765 

(D) OTHER INFORMATION: / Ceres Seq. ID 1569689 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 
15 aaccaaaaat ttcttcgtct ctgtctccaa aatcgaatca 

tttttgttct gttctttttt ttttttaaag aatggcttca 

gccattgtta ctcacacaga gaaaatccaa tctcacatcg 

tctaactcga ggtacgaatg atctttctcc attatctctt 

cagtctgatg gtgaaagcta gtggagaaag ctcagattca 
20 tagtacgatt cagaatgttt gggataagtc tgaagatagg 

ttttRgctgg tattgtagct ctttgggcat cattgaatct 

tgsccgttat ctcgagcgga ttcgaactag trggtatctt 

atcgatatct cttgttcaaa ccggacagac aggagctttc 

tagcggatat acttggccag tgaaccttgt gtgtgtgata 
25 tgatttgttt gcaagtttgt aaaattacat gacagggtgg 

atgtcatgca tttgaaacct gtaaatactt tattgttggt 

aatcttttct aatttcaaag attctctttt atgattatac 

(2) INFORMATION FOR SEQ ID NO: 283: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

35 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 66 

(D) OTHER INFORMATION: / Ceres Seq. ID 1569690 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283~: 
4 0 Asn Gin Lys Phe Leu Arg Leu Cys Leu Gin Asn Arg lie Lys lie Ser 
1 5 "10 15 

Lys Val Ser He Phe Leu Phe Cys Ser Phe Phe Phe Leu Lys Asn Gly 

20 25 30 

Phe Asn Phe Cys Asn Phe Ala Phe Ala He Val Thr His Thr Glu Lys 
45 35 ^ 40 45 

He Gin Ser His He Asp Ser Lys Thr Pro He Phe Ser Asn Ser Arg 

50 55 60 

Tyr Glu 
65 

50 (2) INFORMATION FOR SEQ ID NO: 28 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..107 

60 (D) OTHER INFORMATION: / Ceres Seq. ID 1569691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 
Met Ala Ser He Ser Ala Thr Leu Pro Ser Pro Leu Leu Leu Thr Gin 
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1 








5 










10 










15 




Arg 


Lys 


Ser 


Asn 


Leu 


Thr 


Ser 


He 


Gin 


Lys 


Leu 


Pro 


Phe 


Ser 


Leu 


Thr 








20 










25 










30 






Arg 


Gly 


Thr 


Asn 


Asp 


Leu 


Ser 


Pro 


Leu 


Ser 


Leu 


Thr 


Arg 


Asn 


Pro 


Ser 






35 










40 










45 








Ser 


He 


Ser 


Leu 


Met 


Val 


Lys 


Ala 


Ser 


Gly 


Glu 


Ser 


Ser 


Asp 


Ser 


Ser 




50 










55 










60 










Thr 


Asp 


Leu 


Asp 


Val 


Val 


Ser 


Thr 


He 


Gin 


Asn 


Val 


Trp 


Asp 


Lys 


Ser 


65 










70 










75 










80 


Glu 


Asp 


Arg 


Leu 


Gly 


Leu 


He 


Gly 


Leu 


Gly 


Phe 


Xaa 


Trp 


Tyr 


Cys 


Ser 










85 










90 










95 




Ser 


Leu 


Gly 


He 


He 


Glu 


Ser 


His 


His 


Gly Asn 













10 

Ser Leu Gly 

100 105 
(2) INFORMATION FOR SEQ ID NO: 285: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
2 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .55 

(D) OTHER INFORMATION: / Ceres Seq. ID 1569692 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:285: 

Met Val Lys Ala Ser Gly Glu Ser Ser Asp Ser Ser Thr Asp Leu Asp 
1 5 10 " 15 

Val Val Ser Thr He Gin Asn Val Trp Asp Lys Ser Glu Asp Arg Leu 
20 25 30 

30 Gly Leu He Gly Leu Gly Phe Xaa Trp Tyr Cys Ser Ser Leu Gly He 
35 40 45 

He Glu Ser His His Gly Asn 

50 55 
(2) INFORMATION FOR SEQ ID NO: 28 6: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 base pairs 
■(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 0 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . .78 2 

(D) OTHER INFORMATION: / Ceres Seq. ID 1571042 

4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

atcaaccaaa acaaaacata aaaaacaagt ggaagcttta aaacgagagg gagagagcaa 60 
aaatggcgac gtcgggaacg tacgtgacgg aagttccgct aaaaggatcg gccgagaaat 120 
actacaagag gtggaagaac gagaaccatg tcttccctga tgctatcggc caccacatcc 180 
aaaatgttac cgttcacgaa ggcgaacatg actctcacgg Gtctatcagg agttggaact 240 

50 acacatggga tggaaaggag gaggtgttca aggagagaag agagatagac gatgagacca 300 
aaacgttgac gttaagagga cttgagggtc acgtgatgga gcagctcaaa gtgtacgacg 360 
tcgtctacca attcattccc aaatctgagg atacctgcat cggcaaaatc actttaatat 420 
gggagaagcg caacgatgat tccccagaac caagcggcta catgaaattc gtcaagagct 480 
tggttgctga catgggaaac cacgttagca aaacttaatc atcattccca cagtcgtcgt 540 

55 catcatcatc atcatcatca tcatcatcat catcatcatc atcatcatca tcatcatcat 600 
catcatcatc atcatcacta tctcgattta taagttaaga tgttttcagt ataataaatg 660 
gggtcttgtg gatcgttcat ttctatgtgt aaaccgtttg gttctgtatg atgcttcgat 720 
atattgttat gttcatgatc atatgtcggg ttcgatataa tgattcttaa gattaattta 780 
ct 

60 (2} INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 151 

(D) OTHER INFORMATION: / Ceres Seq. ID 1571043 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
10 Met Ala Thr Ser Gly Thr Tyr Val Thr Glu Val Pro Leu Lys Gly Ser 
1 5 10 15 

Ala Glu Lys Tyr Tyr Lys Arg Trp Lys Asn Glu Asn His Val Phe Pro 

20 25 30 

Asp Ala lie Gly His His He Gin Asn Val Thr Val His Glu Gly Glu 
15 35 40 45 

His Asp Ser His Gly Ser He Arg Ser Trp Asn Tyr Thr Trp Asp Gly 

50 55 60 

Lys Glu Glu Val Phe Lys Glu Arg Arg Glu He Asp Asp Glu Thr Lys 
65 70 75 80 

2 0 Thr Leu Thr Leu Arg Gly Leu Glu Gly His Val Met Glu Gin Leu Lys 

85 90 95 

Val Tyr Asp Val Val Tyr Gin Phe lie Pro Lys Ser Glu Asp Thr Cys 

100 105 110 

He Gly Lys He Thr Leu He Trp Glu Lys Arg Asn Asp Asp Ser Pro 
25 115 120 125 

Glu Pro Ser Gly Tyr Met Lys Phe Val Lys Ser Leu Val Ala Asp Met 

130 135 140 

Gly Asn His Val Ser Lys Thr 
145 150 
30 (2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .718 

40 (D) OTHER INFORMATION: / Ceres Seq. ID 1571079 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
attcaaagac atacaaaata attgagtttt ttttttttaa ttagaaacaa atggggttga 60 
gtggtgttct tcatgtggag gttgaggtta agtctccggc tgaaaagttc tgggtagccc 120 
tcggcgacgg catcaatctc ttccccaaag ctttccctaa cgactacaaa accatccaag 180 

45 ttctagccgg cgacggcaac gctcctggct ccattcgcct cattacttat ggagaaggat 240 
ctccactggt gaagatatcg gcggagagga tcgaagcagt ggatttggag aacaaaagca 300 
tgtcatacag catcattggc ggcgaaatgt tggagtacta caaaacgttc aaaggaacca 360 
tcaccgttat tcctaagaac ggtggcagcc ttctgaaatg gtctggtgag tttgagaaga 420 
ccgcccatga gattgatgac ccacacgtca tcaaggactt tgctgtcaag aacttcaaag 480 

50 agatagatga gtatctcctt aagcaaacta gtgcctaaca ctagaacctt taaattatat 540 
Maagagggtt cgatcgtctc tataagattt ttctaattaa gaagttgaat aaagtggaac 600 
ctctttatga atatccaagt ttgtgatttc ggagtttatg cagcctagta ggccataagc 660 
tttttacaaa agccaattta gtcgaacatt ttgaaaaaat cgaacctttt ggtaaagc 
(2) INFORMATION FOR SEQ ID NO:289: 

5 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
60 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 
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(B) LOCATION: 1..155 

(D) OTHER INFORMATION: / Ceres Seq. ID 1571080 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:289: 
Met Gly Leu Ser Gly Val Leu His Val Glu Val Glu Val Lys Ser Pro 
5 1 5 10 15 

Ala Glu Lys Phe Trp Val Ala Leu Gly Asp Gly lie Asn Leu Phe Pro 

20 25 30 

Lys Ala Phe Pro Asn Asp Tyr Lys Thr lie Gin Val Leu Ala Gly Asp 
35 40 45 

10 Gly Asn Ala Pro Gly Ser lie Arg Leu lie Thr Tyr Gly Glu Gly Ser 
50 55 60 

Pro Leu Val Lys lie Ser Ala Glu Arg lie Glu Ala Val Asp Leu Glu 
65 70 75 80 

Asn Lys Ser Met Ser Tyr Ser lie lie Gly Gly Glu Met Leu Glu Tyr 
15 " 85 90" 95 

Tyr Lys Thr Phe Lys Gly Thr lie Thr Val lie Pro Lys Asn Gly Gly 

100 105 110 

Ser Leu Leu Lys Trp Ser Gly Glu Phe Glu Lys Thr Ala His Glu lie 
115 120 125 

2 0 Asp Asp Pro His Val lie Lys Asp Phe Ala Val Lys Asn Phe Lys Glu 
130 135 140 

lie Asp Glu Tyr Leu Leu Lys Gin Thr Ser Ala 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 2 90: 
2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..808 

(D) OTHER INFORMATION: / Ceres Seq. ID 1572097 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 290: 

atcaaccaaa acaaaacata aaaaacaagt ggaagcttta aaacgagagg gatagagcaa 60 
aaatggcgac gtcgggaacg tacgtgacgg aagttccgct aaaaggatcg gccgagaaat 120 
actacaagag gtggaagaac gagaaccatg tcttccctga tgctatcggc caccacatcc 180 
aaaatgttac cgttcacgaa ggcgaacatg actctcacgg gtctatcagg agttggaact 240 

4 0 acacatggga tggaaaggag gaggtgttca aggagagaag agagatagac gatgagacca 300 
aaacgttgac gttaagagga cttgagggtc acgtgatgga gcagctcaaa gtgtacgacg 360 
tcgtctacca attcattccc aaatctgagg atacctgcat cggcaaaatc actttaatat 420 
gggagaagcg caacgatgat tccccagaac caagcggcta catgaaattc gtcaagagct 480 
tggttgctga catgggaaac cacgttagCC CcAAAAAAAA AAAAAAaaaa cttaatcatc 540 

4 5 attcccacag tcgtcgtcat catcatcatc atcatcatca tcatcatcat catcatcatc 600 
atcatcatca tcatcatcat catcatcatc atcactatct cgatttataa gttaagatgt 660 
tttcagtata ataaatgggg tcttgtggat cgttcatttc tatgtgtaaa ccgtttggtt 720 
ctgtatgatg cttcgatata ttgttatgtt catgarcata tgtcgggttc gatataatga 780 
ttcttaagat taatttacta cacatttc 

50 (2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..195 

60 (D) OTHER INFORMATION: / Ceres Seq. ID 1572098 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
Met Ala Thr Ser Gly Thr Tyr Val Thr Glu Val Pro Leu Lys Gly Ser 
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15 10 15 

Ala Glu Lys Tyr Tyr Lys Arg Trp Lys Asn Glu Asn His Val Phe Pro 

20 " 25 30 

Asp Ala lie Gly His His lie Gin Asn Val Thr Val His Glu Gly Glu 
5 35 40 45 

His Asp Ser His Gly Ser lie Arg Ser Trp Asn Tyr Thr Trp Asp Gly 

50 55 60 

Lys Glu Glu Val Phe Lys Glu Arg Arg Glu lie Asp Asp Glu Thr Lys 
65 70 75 80 

10 Thr Leu Thr Leu Arg Gly Leu Glu Gly His Val Met Glu Gin Leu Lys 

85 " 90 95 

Val Tyr Asp Val Val Tyr Gin Phe lie Pro Lys Ser Glu Asp Thr Cys 

100 " 105 110 

lie Gly Lys lie Thr Leu lie Trp Glu Lys Arg Asn Asp Asp Ser Pro 
15 115 120 125 

Glu Pro Ser Gly Tyr Met Lys Phe Val Lys Ser Leu Val Ala Asp Met 

130 135 140 

Gly Asn His Val Ser Pro Lys Lys Lys Lys Lys Lys Leu Asn His His 
145 150 " ' ^ 155 160 

20 Ser His Ser Arg Arg His His His His His His His His His His His 

165 170 175 

His His His His His His His His His His His His His His His Tyr 
180 185 190 

Leu Asp Leu 
25 195 

(2) INFORMATION FOR SEQ ID NO: 292: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
30 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

3 5 (B) LOCATION: 1..10 4 

(D) OTHER INFORMATION: / Ceres Seq. ID 1572099 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 
Met Glu Gin Leu Lys Val Tyr Asp Val Val Tyr Gin Phe lie Pro Lys 
15 10 15 

4 0 Ser Glu Asp Thr Cys lie Gly Lys lie Thr Leu lie Trp Glu Lys Arg 

20 25 30 

Asn Asp Asp Ser Pro Glu Pro Ser Gly Tyr Met Lys Phe Val Lys Ser 

35 4 0 4 5 

Leu Val Ala Asp Met Gly Asn His Val Ser Pro Lys Lys Lys Lys Lys 
45 50 55 60 

Lys Leu Asn His His Ser His Ser Arg Arg His His His His His His 
65 70 ~ 75 80 

His His His His His His His His His His His His His His His His 
85 90 95 

50 His His His His Tyr Leu Asp Leu 

100 

(2) INFORMATION FOR SEQ ID NO: 293: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 583 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
60 (A) NAME /KEY : - 

(B) LOCATION: 1 . .583 

(D) OTHER INFORMATION: / Ceres Seq. ID 1572890 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 
aaaaacaaaa ataaaaaaaa catcgcacaa gaaaataaaa gatttgtaga atcaactaag 
aaaatggcta gcactatgat gactacattg cctcagttca atggrcttcg agccaccaaa 
atctctgcag ctcctgtaca aggcctggca agtgttcagc ccatgagacg caagggaaat 
ggagctttgg gtgcaaagtg tgacttcatc ggttcatcaa caaatctgat aatggtaacg 
tcgacgaccc tgattttgtt cgcggggaga ttcggacttg cgccatcagc caataggaag 
gcgacagctg gacttaggtt ggaggcacgt gactcaggtc tacaaacggg tgacccggcc 
gggttcacgc ttgcggacac tttggcttgt ggcaccgttg gtcatatcat cggtgtagga 
gttgtccttg gccttaaaaa cattggtgct atttgaagtt cctaaagctc ttttatttgt 
atttgtaaaa Atttgtagat ttttataaca atattctcat gcacctgaac gagatctaat 
ggattttaca agtctttatg tttatcttat aatgttgtat cgc 
(2) INFORMATION FOR SEQ ID NO: 2 94: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDNESS : 

(D) TOPOLOGY: linear 
peptide 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



ID 1572891 



peptide 
1 . . 151 

;D) OTHER INFORMATION: / Ceres Seq. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 
Lys Asn Lys Asn Lvs Lys Asn lie Ala Gin Glu Asn Lys Arg Phe Val 
1 5* 10 15 

Glu Ser Thr Lys Lys Met Ala Ser Thr Met Met Thr Thr Leu Pro Gin 

20 25 30 

Phe Asn Gly Leu Arg Ala Thr Lys lie Ser Ala Ala Pro Val Gin Gly 

35 4 0 4 5 

Leu Ala Ser Val Gin Pro Met Arg Arg Lys Gly Asn Gly Ala Leu Gly 

50 55 60 

Ala Lys Cys Asp Phe lie Gly Ser Ser Thr Asn Leu lie Met Val Thr 
65 ' 70 75 80 

Ser Thr Thr Leu lie Leu Phe Ala Gly Arg Phe Gly Leu Ala Pro Ser 

85 90 95 

Ala Asn Arg Lys Ala Thr Ala Gly Leu Arg Leu Glu Ala Arg Asp Ser 

100 105 110 

Gly Leu Gin Thr Gly Asp Pro Ala Gly Phe Thr Leu Ala A.sp Thr Leu 

115 120 125 

Ala Cys Gly Thr Val Gly His lie lie Gly Val Gly Val Val Leu Gly 

130 135 140 

Leu Lys Asn lie Gly Ala lie 
145 150 
(2) INFORMATION FOR SEQ ID NO: 2 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 130 

(D) OTHER INFORMATION: / Ceres Seq. ID 1572892 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:295: 
Met Ala Ser Thr Met Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg 
15 10 15 

Ala Thr Lys lie Ser Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin 

20 25 30 

Pro Met. Arg Arg Lys Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe 

35~ 40 45 

lie Gly Ser Ser Thr Asn Leu lie Met Val Thr Ser Thr Thr Leu lie 



60 
120 
180 
240 
300 
360 
420 
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50 55 60 

Leu Phe Ala Gly Arg Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala 
65 ' 70 75 80 

Thr Ala Gly Leu Arg Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly 

85 90 95 

Asp Pro Ala Gly Phe Thr Leu Ala Asp Thr Leu Ala Cys Gly Thr Val 

100 105 110 

Gly His lie lie Gly Val Gly Val Val Leu Gly Leu Lys Asn lie Gly 
115 ^ 120 125 

Ala He 
130 

(2) INFORMATION FOR SEQ ID NO:296: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1. . 126 

{ D ) OTHER INFORMATION: / Ceres Seq. ID 1572893 
(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 2 9 6 : 
Met Met Thr Thr Leu Pro Gin Phe Asn Gly Leu Arg Ala Thr Lys He 
15 10 15 

Ser Ala Ala Pro Val Gin Gly Leu Ala Ser Val Gin Pro Met Arg Arg 

20 25 30 

Lys Gly Asn Gly Ala Leu Gly Ala Lys Cys Asp Phe He Gly Ser Ser 

" 3 5 4 0 4 5 

Thr Asn Leu He Met Val Thr Ser Thr Thr Leu He Leu Phe Ala Gly 

50 55 60 

Arg Phe Gly Leu Ala Pro Ser Ala Asn Arg Lys Ala Thr Ala Gly Leu 
65 7 0 7 5 '80 

Arg Leu Glu Ala Arg Asp Ser Gly Leu Gin Thr Gly Asp Pro Ala Gly 

85 90 95 

Phe Thr Leu Ala Asp Thr Leu Ala Cys Gly Thr Val Gly His He He 

100 105 110 

Gly Val Gly Val Val Leu Gly Leu Lys Asn He Gly Ala He 

115 120 125 

(2) INFORMATION FOR SEQ ID NO: 297: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..7 48 

(D) OTHER INFORMATION: / Ceres Seq. ID 1573606 
SEQUENCE DESCRIPTION: SEQ ID NO: 297: 
ccttctcact tcaaattcat aatctctcta cttctctctc tctctaaatt caaaaatggc 
gaaactggtg atgttgttgg ttctttgtat cttaccggcg ataGccatgg cggcaaggag 
gagtaatatt ggaaagaata caatggtggt tcaaggtagc acttactgtg acatttgcaa 
attcggcttc gagactcctg aatcctccta cttcatcccc ggtgcaacgg tgaagctatc 
atgcaaagac aggaagacaa tggaagaggt ttacacagac aaagctgtat cggacaaaga 
aggaaagtat aagttcattg tccacgacga tcacagagac cagatgtgcg atgttttgct 
tgtgaaaagc tcggataaaa cctgctctaa aatctccgtt ggacgtgaga agtctcgtgt 
gatcttgaac cattacagtg gcattgcctc gcagatcaga catgctaaca acatgggatt 
cgagaaagaa gtgagtgatg tgttctgctc tgctttgttt cagaagtata tggttgatga 
agatgaggat gatattaaaa accatctcta atctctctgt ttaatcttat gatctgctgt 
tttcttcatt aatgagtttc gagttatgga agagatatat ttgtatttgt ttgattactt 



(ii) 
(ix) 



(xi) 



60 
120 
180 
240 
300 
360 
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540 
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atttgttgtc tttagagatg ttgactctgg tgatcggata actatctgtt tgtgtaagct 720 
tcttatatat tgatgtgtca tttccttg 
(2) INFORMATION FOR SEQ ID NO: 298: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..171 

(D) OTHER INFORMATION: / Ceres Seq. ID 1573607 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: 
15 Met Ala Lys Leu Val Met Leu Leu Val Leu Cys lie Leu Pro Ala lie 
15 10 15 

Ala Met Ala Ala Arg Arg Ser Asn lie Gly Lys Asn Thr Met Val Val 

20 25 30 

Gin Gly Ser Thr Tyr Cys Asp lie Cys Lys Phe Gly Phe Glu Thr Pro 
20 " 35 40 45 

Glu Ser Ser Tyr Phe lie Pro Gly Ala Thr Val Lys Leu Ser Cys Lys 

5 0 5 5 60 

Asp Arg Lys Thr Met Glu Glu Val Tyr Thr Asp Lys Ala Val Ser Asp 
65 " 70 75 80 

25 Lys Glu Gly Lys Tyr Lys Phe lie Val His Asp Asp His Arg Asp Gin 

85 90 95 

Met Cys Asp Val Leu Leu Val Lys Ser Ser A.sp Lys Thr Cys Ser Lys 

10 0 105 110 

lie Ser Val Gly Arg Glu Lys Ser Arg Val He Leu Asn His Tyr Ser 
30 115 120 125 

Gly He Ala Ser Gin He Arg His Ala Asn Asn Met Gly Phe Glu Lys 

130 135 140 

Glu Val Ser Asp Val Phe Cys Ser Ala Leu Phe Gin Lys Tyr Met Val 
145 150 155 160 

35 Asp Glu Asp Glu Asp Asp He Lys Asn His Leu 

165 170 
(2) INFORMATION FOR SEQ ID NO: 299: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 166 

(D) OTHER INFORMATION: / Ceres Seq. ID 1573608 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:299: 
Met Leu Leu Val Leu Cys He Leu Pro Ala He Ala Met Ala Ala Arg 
50 1 5 10 15 

Arg Ser Asn He Gly Lys Asn Thr Met Val Val Gin Gly Ser Thr Tyr 

20 25 30 

Cys Asp He Cys Lys Phe Gly Phe Glu Thr Pro Glu Ser Ser Tyr Phe 
35 4 0 4 5 

55 He Pro Gly Ala Thr Val Lys Leu Ser Cys Lys Asp Arg Lys Thr Met 
50 55 60 

Glu Glu Val Tyr Thr Asp Lys Ala Val Ser Asp Lys Glu Gly Lys Tyr 
65 ~ 70 75 80 

Lys Phe He Val His Asp Asp His Arg Asp Gin Met Cys Asp Val Leu 
60 ' 85 90 95 

Leu Val Lys Ser Ser Asp Lys Thr Cys Ser Lys He Ser Val Gly Arg 
100 105 110 
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Glu Lys 



He 

Phe 
145 
Asp 

(2) 



Arg 
130 
Cys 



Ser 
115 
His 



Arg Val He Leu 



Ala Asn Asn 



Ser Ala Leu 



He Lys Asn 

INFORMATION 

(i) 



His 
165 
FOR 



Phe 
150 
Leu 



Met 
135 
Gin 



Asn 
120 
Gly 



468 
His Tyr 



Ser Gly 



Phe Glu Lys 



Lys Tyr Met 



Val 
155 



Glu 
140 
Asp 



He 
125 
Val 



Ala Ser Gin 



Ser Asp Val 



Glu Asp Glu 



Asp 
160 



SEQ ID NO:300: 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: c..mino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 154 

(D) OTHER INFORMATION: / Ceres 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



Seq. 
300: 



ID 1573609 



Met 


Ala 


Ala 


Arg 


Arg 


Ser 


Asn 


He 


Gly 


Lys 


Asn 


Thr 


Met 


Vul 


Val 


Gin 










5 










10 










15 




Gly 


Ser 


Thr 


Tyr 


Cys 


Asp 


He 


Cys 


Lys 


Phe 


Gly 


Phe 


Glu 


Thr 


Pro 


Glu 








20 










25 










30 






Ser 


Ser 


Tyr 


Phe 


He 


Pro 


Gly 


Ala 


Thr 


Val 


Lys 


Leu 


Ser 


Cys 


Lys 


Asp 






35 










40 










45 








Arg 


Lys 


Thr 


Met 


Glu 


Glu 


Val 


Tyr 


Thr 


Asp 


Lys 


Ala 


Val 


Ser 


Asp 


Lys 




50 










55 










60 










Glu 


Gly 


Lys 


Tyr 


Lys 


Phe 


He 


Val 


His 


Asp 


Asp 


His 


Arg 


Asp 


Gin 


Met 


65 










70 










75 










80 


Cys 


Asp 


Val 


Leu 


Leu 


Val 


Lys 


Ser 


Ser 


Asp 


Lys 


Thr 


Cys 


Ser 


Lys 


He 










85 










90 










95 




Ser 


Val 


Gly 


Arg 


Glu 


Lys 


Ser 


Arg 


Val 


He 


Leu 


Asn 


His 


Tyr 


Ser 


Gly 








100 










105 










110 






He 


Ala 


Ser 


Gin 


He 


Arg 


His 


Ala 


Asn 


Asn 


Met 


Gly 


Phe 


Glu 


Lys 


Glu 






115 










120 










125 








Val 


Ser 


Asp 


Val 


Phe 


Cys 


Ser 


Ala 


Leu 


Phe 


Gin 


Lys 


Tyr 


Met 


Val 


Asp 




130 










135 










140 










Glu 


Asp 


Glu 


Asp 


Asp 


He 


Lys 


Asn 


His 


Leu 














145 










150 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:301: 

















(i) 



(ii) 
(ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



linear 

DNA (genomic) 



(xi) 
aaaaaatcga 
taaagaatgg 
tccaatctca 
tctccattat 
gaaagctcag 
aagtctgaaa 
gcatcattga 
ctagttggta 
agacaggagc 
cttgtgtgtg 



(D) TOPOLOGY: 
MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 704 

(D) OTHER INFORMATION: / Ceres Seq. ID 1573861 
SEQUENCE DESCRIPTION: SEQ ID NO: 301: 



atcaaaatct 
cttcaatttc 
catcgattca 
ctcttactcg 
attcatcgac 
ataggttagg 
atctcatcac 
tcttgttctc 
tttcgaaaat 
tgataatact 



ctaaagtttc 
tgcaactttg 
aaaactccca 
aaaccctagc 
tgatctcgac 
tcttattggt 
ggcaattgac 
cacgtggttc 
tgtcaagaaa 
tcatctttgg 



aatttttttg 
ccttcgccat 
ttttctctaa 
agcatcagtc 
gttgttagta 
ttgggttttg 
aaattgcccg 
acatatcgat 
tcagtagcgg 
aagatgattt 



ttctgttctt 
tgttactcac 
ctcgaggtac 
tgatggtgaa 
cgattcagaa 
ctggtattgt 
ttatctcgag 
atctcttgtt 
atatacttgg 
gtttgcaagt 



tttttttttt 
acagagaaaa 
gaatgatctt 
agctagtgga 
tgtttgggat 
agctctttgg 
cggattcgaa 
caaaccggac 
cAcagtgaac 
ttgtaaaatt 



60 
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180 
240 
300 
360 
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acatgacagg gtggttgttg tttctagtcc aataatgtca tgcatttgaa acctgtaaat 660 
actttattgt tggtttttgg ttgtgagcaa aatcaatctt ttct 
(2) INFORMATION FOR SEQ ID NO: 302: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 179 

(D) OTHER INFORMATION: / Ceres Seq. ID 1573862 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 



15 


Met 
1 


Ala 


Ser 


He 


Ser 
5 


Ala 


Thr 


Leu 


Pro 


Ser 
10 


Pro 


Leu 


Leu 


Leu 


Thr 
15 


Gin 




Arg 


Lys 


Ser 


Asn 

20 


Leu 


Thr 


Ser 


He 


Gin 
25 


Lys 


Leu 


Pro 


Phe 


Ser 
30 


Leu 


Thr 




Arg 


Gly 


Thr 


Asn 


Asp 


Leu 


Ser 


Pro 


Leu 


Ser 


Leu 


Thr 


Arg 


Asn 


Pro 


Ser 


20 






35 










40 










45 ^ 










Ser 


He 
50 


Ser 


Leu 


Met 


Val 


Lys 
55 


Ala 


Ser 


Gly 


Glu 


Ser 
60 


Ser 


Asp 


Ser 


Ser 




Thr 


Asp 


Leu 


Asp 


Val 


Val 


Ser 


Thr 


He 


Gin 


Asn 


Val 


Trp 


Asp 


Lys 


Ser 




65 










70 










75 










80 


25 


Glu 


Asn 


Arg 


Leu 


Gly 
85 


Leu 


He 


Gly 


Leu 


Gly 
90 


Phe 


Ala 


Gly 


lie 


Val 
95 


Ala 




Leu 


Trp 


Ala 


Ser 
100 


Leu 


Asn 


Leu 


He 


Thr 
105 


Ala 


He 


Asp 


Lys 


Leu 
110 


Pro 


Val 




He 


Ser 


Ser 


Gly 


Phe 


Glu 


Leu 


Val 


Gly 


He 


Leu 


Phe 


Ser 


Thr 


Trp 


Phe 


30 






115 










120 










125 










Thr 


Tyr 
130 


Arg 


Tyr 


Leu 


Leu 


Phe 
135 


Lys 


Pro 


Asp 


Arg 


Gin 
140 


Glu 


Leu 


Ser 


Lys 




He 


Val 


Lys 


Lys 


Ser 


Val 


Ala 


Asp 


He 


Leu 


Gly 


Thr 


Val 


Asn 


Leu 


Val 




145 










150 










155 










160 


35 


Cys 


Val 


He 


lie 


Leu 


His 


Leu 


Trp 


Lys 


Met 


lie 


Cys 


Leu 


Gin 


Val 


Cys 



165 170 175 



Lys lie Thr 

(2) INFORMATION FOR SEQ ID MO: 303: 
4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 127 

(D) OTHER INFORMATION: / Ceres Seq. ID 1573863 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: 



Met 


Val 


Lys 


Ala 


Ser 


Gly 


Glu 


Ser 


Ser 


Asp 


Ser 


Ser 


Thr 


Asp 


Leu 


Asp 


1 








5 










10 










15 




Val 


Val 


Ser 


Thr 

20 


He 


Gin 


Asn 


Val 


Trp 

25 


Asp 


Lys 


Ser 


Glu 


Asn 
30 


Arg 


Leu 


Gly 


Leu 


He 
35 


Gly 


Leu 


Gly 


Phe 


Ala 
40 


Gly 


lie 


Val 


Ala 


Leu 
45 


Trp 


Ala 


Ser 


Leu 


Asn 
50 


Leu 


lie 


Thr 


Ala 


lie 

55 


Asp 


Lys 


Leu 


Pro 


Val 
60 


He 


Ser 


Ser 


Gly 


Phe 


Glu 


Leu 


Val 


Gly 


lie 


Leu 


Phe 


Ser 


Thr 


Trp 


Phe 


Thr 


Tyr 


Arg 


Tyr 


65 










70 










75 










80 


Leu 


Leu 


Phe 


Lys 


Pro 
85 


Asp 


Arg 


Gin 


Glu 


Leu 
90 


Ser 


Lys 


lie 


Val 


Lys 
95 


Lys 
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Ser Val Ala Asp lie Leu Gly Thr Val Asn Leu Val Cys Val lie lie 



100 105 
Leu His Leu Trp Lys Met lie Cys Leu Gin Val Cys Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 304: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
DNA (genomic) 



110 

lie Thr 



(ii) MOLECULE TYPE: 
fix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



1. . 662 

(D) OTHER INFORMATION: / Ceres Seq. ID 1574093 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 
attttttctt tctctctctc ctccctaagc aaaactaaaa caagctatgg ctggtatgct 
tcccggagtt gagtgtgcaa ggaggcggcg cttccacggt ggtgctcctc cgattgaatc 
ctcgaacaca gcttctgtgg cggctgcggc gggacacgtc tggacacggc gaccatcgtt 
ctctctttac actaccaatc atgagagcca ccaggcccat gtctccttct cggagagaag 
tgttaggaat aaatcttatg gagaagacaa cgatgagaaa cttgacggag cagccaaaga 
ggcaaagcag aggcttaaca agcggctgag aatcccacca cgtacaagtt caggcaaaat 
ggtaaagaca aagggaataa attggagcaa ggaaagggta aacctctcgg ggacttaccg 
accgaggtgg Gtcgggttaa agaagagccg aggaaggttg atggaatggt tcaagcggcg 
agttagggaa caacaagatt gtgctatatg tctagaccgg ttcaagaagg gtgagacctt 
ggtacaccta ccatgtgccc ataagtttca ctccatatgc ttattgcctt ggctagacac 
taatgtttat tgcccatatt gtagaactga tatttggaat taaatgttat atttttgatg 
tc 

(2) INFORMATION FOR SEQ ID NO: 305: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



peptide 
1. .198 

(D) OTHER INFORMATION: / Ceres Seq. ID 1574094 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 
Met Ala Gly Met Leu Pro Gly Val Glu Cys Ala Arg Arg Arg Arg Phe 
1 5 10 15 

His Gly Gly Ala Pro Pro lie Glu Ser Ser Asn Thr Ala Ser Val Ala 

20 25 30 

Ala Ala Ala Gly His Val Trp Thr Arg Arg Pro Ser Phe Ser Leu Tyr 

35 40 45 

Thr Thr Asn His Glu Ser His Gin Ala His Val Ser Phe Ser Glu Arg 

50 55 60 

Ser Val Arg Asn Lys Ser Tyr Gly Glu Asp Asn Asp Glu Lys Leu Asp 
65 70 75 80 

Gly Ala Ala Lys Glu Ala Lys Gin Arg Leu Asn Lys Arg Leu Arg lie 

85 90 95 

Pro Pro Arg Thr Ser Ser Gly Lys Met Val Lys Thr Lys Gly lie Asn 

100 105 110 

Trp Ser Lys Glu Arg Val Asn Leu Ser Gly Thr Tyr Arg Pro Arg Trp 

115 120 125 

Val Gly Leu Lys Lys Ser Arg Gly Arg Leu Met Glu Trp Phe Lys Arg 

130 135 140 

Arg Val Arg Glu Gin Gin Asp Cys Ala lie Cys Leu Asp Arg Phe Lys 
145 150 ' 155 160 

Lys Gly Glu Thr Leu Val His Leu Pro Cys Ala His Lys Phe His Ser 
165 17 0 175 



60 
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lie Cys Leu Leu Pro Trp Leu Asp Thr Asn 
180 185 
Thr Asp lie Trp Asn 
195 

INFORMATION FOR SEQ ID NO: 306: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 



Val Tyr Cys 



Pro 
190 



Tyr Cys 



Arg 

(2) 



(ii) 
(ix) 



(xi) 



(D) TOPOLOGY: 
MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



linear 
peptide 



peptide 
1 . . 195 

(D) OTHER INFORMATION: / Ceres 
SEQUENCE DESCRIPTION: SEQ ID NO: 



Seq. 
306: 



ID 1574095 



Met 


Leu 


Pro 


Gly 


Val 


Glu 


Cys 


Ala 


Arg 


Arg 


Arg 


Arg 


Phe 


His 


Gly 


Gly 


1 








5 










1(T 










15 




Ala 


Pro 


Pro 


He 


Glu 


Ser 


Ser 


Asn 


Thr 


Ala 


Ser 


Val 


Ala 


Ala 


Ala 


Ala 








20 










25 










30 






Gly 


His 


Val 


Trp 


Thr 


Arg 


Arg 


Pro 


Ser 


Phe 


Ser 


Leu 


Tyr 


Thr 


Thr 


Asn 






35 










40 










45 








His 


Glu 


Ser 


His 


Gin 


Ala 


His 


Val 


Ser 


Phe 


Ser 


Glu 


Arg 


Ser 


Val 


Arg 




50 










55 










60 








Asn 


Lys 


Ser 


Tyr 


Gly 


Glu 


Asp 


Asn 


Asp 


Glu 


Lys 


Leu 


Asp 


Gly 


Ala 


Ala 


65 










70 










75 










80 


Lys 


Glu 


Ala 


Lys 


Gin 


Arg 


Leu 


Asn 


Lys 


Arg 


Leu 


Arg 


He 


Pro 


Pro 


Arg 










85 










90 










95 




Thr 


Ser 


Ser 


Gly 


Lys 


Met 


Val 


Lys 


Thr 


Lys 


Gly 


He 


Asn 


Trp 


Ser 


Lys 








100 










105 










110 






Glu 


Arg 


Val 


Asn 


Leu 


Ser 


Gly 


Thr 


Tyr 


Arg 


Pro 


Arg 


Trp 


Val 


Gly 


Leu 






115 










120 










125 








Lys 


Lys 


Ser 


Arg 


Gly 


Arg 


Leu 


Met 


Glu 


Trp 


Phe 


Lys 


Arg 


Arg 


Val 


Arg 




130 










135 










140 










Glu 


Gin 


Gin 


Asp 


Cys 


Ala 


He 


Cys 


Leu 


Asp 


Arg 


Phe 


Lys 


Lys 


Gly 


Glu 


145 










150 










155 










160 


Thr 


Leu 


Val 


His 


Leu 


Pro 


Cys 


Ala 


His 


Lys 


Phe 


His 


Ser 


He 


Cys 


Leu 










165 










170 










175 




Leu 


Pro 


Trp 


Leu 


Asp 


Thr 


Asn 


Val 


Tyr 


Cys 


Pro 


Tyr 


Cys 


Arg 


Thr 


Asp 








180 










18 5 










190 






lie 


Trp 


Asn 
































195 




























(2) 


INFORMATION 


FOR 


SEQ 


id : 


NO: 307 : 

















(i) 



(ii) 
(ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .725 

(D) OTHER INFORMATION: / Ceres 
SEQUENCE DESCRIPTION: SEQ ID NO: 



Seq. 
307 : 



ID 1580388 



(xi) 

ctcacacaaa aatacaacaa cttagatcag tctcaaaggg ggaaaaaaaa cttaaaagaa 60 

acattaagag gcaacacaaa tcacacaaaa gatcaaattg aagcctaaga agaaggcaaa 120 

aagtgagaag caatggctac catgttgaag gtctctcttg tattgtcatt gttaggtttt 180 

ctggtgatcg ctgtcgtgac tccatcggcg gcgaacccat tcaggaagag cgtagttctc 240 

ggagggaagt caggcgttcc aaacattcgg accaacaggg aaattcaaca acttggaagg 300 

tactgcgtgg agcaattcaa tcaacaagca cagaacgagc aaggaaacat aggatccatt 360 

gcgaaaacag acacggcaat ttcgaatcca ttgcaattta gccgagtagt gtctgctcag 420 

aaacaggtcg tcgctggact caaatactat ctaaggattg aagtcactca acccaatggc 480 
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tctaccagga tgtttgactc tgttgtggtt attcaaccat ggctccattc taagcagttg 540 
ctcggtttca ctcctgttgt cagtcctgtc tactaacttt atttcttctt attcgactta 600 
aatttccata atatgatcaa gaaaagacta aaaggtgtat gatacaaagc tattaagaat 660 
gggttaatag ttggttttca tgatatgttt acgttgttca taaataaaaa caagttgtta 720 
ttagg 

(2) INFORMATION FOR SEQ ID NO: 308: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..147 

(D) OTHER INFORMATION: / Ceres Seq. ID 1580389 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 



Met 


Ala 


Thr 


Met 


Leu 


Lys 


Val 


Ser 


Leu 


Val 


Leu 


Ser 


Leu 


Leu 


Gly 


Phe 


1 








5 










10 










15 . 




Leu 


Val 


He 


Ala 

20 


Val 


Val 


Thr 


Pro 


Ser 
25 


Ala 


Ala 


Asn 


Pro 


Phe 
30 


Arg 


Lys 


Ser 


Val 


Val 
35 


Leu 


Gly 


Gly 


Lys 


Ser 
40 


Gly 


Val 


Pro 


Asn 


He 
45 


Arg 


Thr 


Asn 


Arg 


Glu 
50 


He 


Gin 


Gin 


Leu 


Gly 
55 


Arg 


Tyr 


Cys 


Val 


Glu 
60 


Gin 


Phe 


Asn 


Gin 


Gin 


Ala 


Gin 


Asn 


Glu 


Gin 


Gly 


Asn 


He 


Gly 


Ser 


He 


Ala 


Lys 


Thr 


Asp 


65 










70 










75 










80 


Thr 


Ala 


He 


Ser 


Asn 
85 


Pro 


Leu 


Gin 


Phe 


Ser 
90 


Arg 


Val 


Val 


Ser 


Ala 
95 


Gin 


Lys 


Gin 


Val 


Val 
100 


Ala 


Gly 


Leu 


Lys 


Tyr 
105 


Tyr 


Leu 


Arg 


He 


Glu 
110 


Val 


Thr 


Gin 


Pro 


Asn 
115 


Gly 


Ser 


Thr 


Arg 


Met 
120 


Phe 


Asp 


Ser 


Val 


Val 
125 


Val 


He 


Gin 


Pro 


Trp 
130 


Leu 


His 


Ser 


Lys 


Gin 
135 


Leu 


Leu 


Gly 


Phe 


Thr 
140 


Pro 


Val 


Val 


Ser 



Pro Val Tyr 
145 

(2) INFORMATION FOR SEQ ID NO: 309: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..144 

(D) OTHER INFORMATION: / Ceres Seq. ID 1580390 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 



Met 


Leu 


Lys 


Val 


Ser 


Leu 


Val 


Leu 


Ser 


Leu 


Leu 


Gly 


Phe 


Leu 


Val 


He 


1 








5 










10 










15 




Ala 


Val 


Val 


Thr 


Pro 


Ser 


Ala 


Ala 


Asn 


Pro 


Phe 


Arg 


Lys 


Ser 


Val 


Val 








20 










25 










30 






Leu 


Gly 


Gly 


Lys 


Ser 


Gly 


Val 


Pro 


Asn 


He 


Arg 


Thr 


Asn 


Arg 


Glu 


He 






35 










40 










45 








Gin 


Gin 


Leu 


Gly 


Arg 


Tyr 


Cys 


Val 


Glu 


Gin 


Phe 


Asn 


Gin 


Gin 


Ala 


Gin 




50 










55 










60 










Asn 


Glu 


Gin 


Gly 


Asn 


He 


Gly 


Ser 


He 


Ala 


Lys 


Thr 


Asp 


Thr 


Ala 


He 


65 










70 










75 










80 


Ser 


Asn 


Pro 


Leu 


Gin 


Phe 


Ser 


Arg 


Val 


Val 


Ser 


Ala 


Gin 


Lys 


Gin 


Val 










85 










90 










95 




Val 


Ala 


Gly 


Leu 


Lys 


Tyr 


Tyr 


Leu 


Arg 


He 


Glu 


Val 


Thr 


Gin 


Pro 


Asn 
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100 



105 



110 



Gly Ser Thr Arg Met Phe Asp Ser Val Val Val lie Gin Pro Trp Leu 



115 



120 



125 



His Ser Lys Gin Leu Leu Gly Phe Thr Pro Val Val Ser Pro Val Tyr 



(2) 



130 135 
INFORMATION FOR SEQ ID NO: 310: 



140 



(i) 



(ii) 
(ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



DNA ( genomic ) 



1 . . 675 

(D) OTHER INFORMATION: / Ceres Seq. ID 1582959 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
tgggcaaggc aacaaacata atcaacttaa atcttatcta ctttctattt ctt tttaatc 
aaaattaccg ttcttaacta tggcgaagtg gtttttcact atcttcttgg tttttgccct 
agcctcagct ttagcttgtg gcgcaagaaa cgtcccagta ggcctctctg accaaaagaa 
ctacctcgga tatggtggcg gatattccgg cgttggagac aatggtttac cctttggtgg 
cgtcggtgga ggtgtgtctg gtcccggagg taatcttggt tatgggggat ttggtggtgc 
tggtggcggc ttaggcggtg gtttgggcgg tggagcaggc agtggattag gcggtggctt 
aggtggtgga agtggaattg gtgccggaac cagtggagga agtaccggag gagttcattt 
cccttgagtt gttactttgg tttttaaggc gtcatacggt ccttattaag ctaggtctag 
cttaagatga tgtcataata ataatttatc atatctcttt agggttttaa actttggtat 
tatgaattat cattagctgt ttaacgtgcg tcttaagtta ctattttaac gtatgtttga 
atcagtctag tggcttgtcg tgtcatggct tggtccattt tcaaattcta ctttgacctt 
ttcgagtgtt tcacc 

(2) INFORMATION FOR SEQ ID NO: 311: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1. .115 

(D) OTHER INFORMATION: / Ceres Seq. ID 1582960 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
Met Ala Lys Trp Phe Phe Thr lie Phe Leu Val Phe Ala Leu Ala Ser 
1 " 5 10 15 

Ala Leu Ala Cys Gly Ala Arg Asn Val Pro Val Gly Leu Ser Asp Gin 

20 25 30 

Lys Asn Tyr Leu Gly Tyr Gly Gly Gly Tyr Ser Gly Val Gly Asp Asn 

35 " 4 0 4 5 

Gly Leu Pro Phe Gly Gly Val Gly Gly Gly Val Ser Gly Pro Gly Gly 

50 55 60 

Asn Leu Gly Tyr Gly Gly Phe Gly Gly Ala Gly Gly Gly Leu Gly Gly 
65 70 75 80 

Gly Leu Gly Gly Gly Ala Gly Ser Gly Leu Gly Gly Gly Leu Gly Gly 

85 90 95 

Gly Ser Gly lie Gly Ala Gly Thr Ser Gly Gly Ser Thr Gly Gly Val 
100 105 110 

His Phe Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 312: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



60 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
5 (B) LOCATION: 1..1076 

(D) OTHER INFORMATION: / Ceres Seq. ID 1663221 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 



aacacgagct taccgattct cttcgtctca acatttaggg tttcagagat cggctgattt 60 

tcacatccac aagtatggcg actcaaatca gcaagaagag aaagtttgta gcggatggtg 120 

10 tattctacgc tgaattgaat gaggttctca caagagagct agcagaggat ggttactctg 18 0 

gtgttgaggt tagggttact ccaatgagga ctgagattat catcagagct actcgtactc 24 0 

agaatgttct tggtgagaag gggaggagaa ttagggaatt gacatctctt gtccagaaga 300 

gattcaagtt tccagttgac agtgttgagc tctatgctga gaaggttaac aacagaggtc 360 

tctgtgccat tgctcaggct gagtctctac gttacaagct tctcggtggt cttgctgttc 420 

15 gcagggcctg ttatggtgtt ttgagatttg ttatggagag tggagctaag ggatgtgagg 480 

tcatcgtgag tggaaagctc cgtgctgcac gtgctaagtc catgaagttc aaggatggtt 540 

acatggtttc atctggtcaa ccaactaagg aatacatcga tgctgcagtg agacatgttt 600 

tgctcagaca gggggtgttg ggaatcaagg tgaagatcat gcttgactgg gaccctacgg 660 

gcaaatcagg accaaagaca ccattgcctg atgttgtgat cattcatgct cctaaagatg 720 

20 atgttgtcta ctctgcacct gctcaggctg ctgctccagt tactcttgtg caagaagctc 780 

cactcacaac cgtagattac cctgagatga ttcctccagt ggcctagaga agaccttttt 840 

tactattact ccatgggatt ttgttctttt tgttataact tttctacttt tgacactctc 900 

tagtcgtatc tcttatatcc tccaagcaga caagtttttt ctcatgttgt ttcttaacct 960 

atgtcaagtt ttctggaggt tgaatgcttt tagttccatg atttttttgt tcttaagact 1020 



25 cgtaataaga aaaagaatga cgacaacaat gcctggtatt tgatgttgtt actttg 
(2) INFORMATION FOR SEQ ID NO: 313: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
30 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
35 (B) LOCATION: 1..250 

(D) OTHER INFORMATION: / Ceres Seq. ID 1663222 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
Met Ala Thr Gin lie Ser Lys Lys Arg Lys Phe Val Ala Asp Gly Val 
15 10 15 

4 0 Phe Tyr Ala Glu Leu Asn Glu Val Leu Thr Arg Glu Leu Ala Glu Asp 

2 0 25 30 

Gly Tyr Ser Gly Val Glu Val Arg Val Thr Pro Met Arg Thr Glu lie 

35 40 45 

lie lie Arg Ala Thr Arg Thr Gin Asn Val Leu Gly Glu Lys Gly Arg 
45 50 55 60 

Arg lie Arg Glu Leu Thr Ser Leu Val Gin Lys Arg Phe Lys Phe Pro 
65 70 75 80 

Val Asp Ser Val Glu Leu Tyr Ala Glu Lys Val Asn Asn Arg Gly Leu 
8 5 9 0 95 

50 Cys Ala lie Ala Gin Ala Glu Ser Leu Arg Tyr Lys Leu Leu Gly Gly 

100 105 110 

Leu Ala Val Arg Arg Ala Cys Tyr Gly Val Leu Arg Phe Val Met Glu 

115 120 125 

Ser Gly Ala Lys Gly Cys Glu Val lie Val Ser Gly Lys Leu Arg Ala 
55 130 " " 135 140 

Ala Arg Ala Lys Ser Met Lys Phe Lys Asp Gly Tyr Met Val Ser Ser 
145 ' 150 155 160 

Gly Gin Pro Thr Lys Glu Tyr He Asp Ala Ala Val Arg His Val Leu 
165 170 175 

60 Leu Arg Gin Gly Val Leu Gly He Lys Val Lys He Met Leu Asp Trp 

180 185 190 

Asp Pro Thr Gly Lys Ser Gly Pro Lys Thr Pro Leu Pro Asp Val Val 
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195 200 205 

lie lie His Ala Pro Lys Asp Asp Val Val Tyr Ser Ala Pro Ala Gin 

210 215 220 

Ala Ala Ala Pro Val Thr Leu Val Gin Glu Ala Pro Leu Thr Thr Val 
5 225 230 235 240 

Asp Tyr Pro Glu Met lie Pro Pro Val Ala 
245 250 
(2) INFORMATION FOR SEQ ID NO: 314: 
(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 207 amino acids 

■(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

15 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 207 

(D) OTHER INFORMATION: / Ceres Seq. ID 1663223 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
2 0 Met Arg Thr Glu lie lie lie Arg Ala Thr Arg Thr Gin Asn Val Leu 
15 10 15 

Gly Glu Lys Gly Arg Arg lie Arg Glu Leu Thr Ser Leu Val Gin Lys 

20 2 5 30 

Arg Phe Lys Phe Pro Val Asp Ser Val Glu Leu Tyr Ala Glu Lys Val 
2 5 35 4 0 4 5 

Asn Asn Arg Gly Leu Cys Ala lie Ala Gin Ala Glu Ser Leu Arg Tyr 

50 55 60 

Lys Leu Leu Gly Gly Leu Ala Val Arg Arg Ala Cys Tyr Gly Val Leu 
65 70 75 80 

30 Arg Phe Val Met Glu Ser Gly Ala Lys Gly Cys Glu Val lie Val Ser 

85 90 95 

Gly Lys Leu Arg Ala Ala Arg Ala Lys Ser Met Lys Phe Lys Asp Gly 

100 105 110 

Tyr Met Val Ser Ser Gly Gin Pro Thr Lys Glu Tyr lie Asp Ala Ala 
35 115 120 125 

Val Arg His Val Leu Leu Arg Gin Gly Val Leu Gly lie Lys Val Lys 

130 135 140 

lie Met Leu Asp Trp Asp Pro Thr Gly Lys Ser Gly Pro Lys Thr Pro 
145 150 155 160 

4 0 Leu Pro Asp Val Val lie lie His Ala Pro Lys Asp Asp Val Val Tyr 

165 170 ^ 175 

Ser Ala Pro Ala Gin Ala Ala Ala Pro Val Thr Leu Val Gin Glu Ala 

180 185 190 

Pro Leu Thr Thr Val Asp Tyr Pro Glu Met lie Pro Pro Val Ala 
45 195 200 205 

(2) INFORMATION FOR SEQ ID NO: 315: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/ KEY : - 
55 (B) LOCATION: 1..488 

(D) OTHER INFORMATION: / Ceres Seq. ID 1663275 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
gtgactaaat ttctagggtt ttgaagagtc tctctcgtcg ccgtttgtta ttcctctgtg 60 
agcaaatcga cgaaatgggt cactctaatg tatggaactc tcatccgaag aagtacggtc 120 
60 ctggatctcg tttatgccgt gtgtgcggga actcgcacgg tctgatccgg aagtatggtt 180 
tgaactgctg cagacagtgt ttccgtagca acgctaagga gattggattc attaagtacc 24 0 

gttaatcaag caccaacttc atgattgatg cttaatgata taaacatgaa ggcgtcgatg 300 
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ggtatggctt ttaagctttt gtagtttttg aaatttttac ttttgagaac cattgttatt 360 

ttgggagtta attaagttgt tgaacctcat taagcatgtc ttattttgga ttaatgatgt 420 

tttggctatt ctcgtatttt tgttttatca gtcaaatttg actaagaaga atttgcttgt 480 



tattagcc 

(2) INFORMATION FOR SEQ ID NO: 316: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..5 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1663276 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:316: 
Met Gly His Ser Asn Val Trp Asn Ser His Pro Lys Lys Tyr Gly Pro 
1^5 10 15 

Gly Ser Arg Leu Cys Arg Val Cys Gly Asn Ser His Gly Leu lie Arg 

20 25 30 

Lys Tyr Gly Leu Asn Cys Cys Arg Gin Cys Phe Arg Ser Asn Ala Lys 

35 40 * 45 

Glu lie Gly Phe lie Lys Tyr Arg 

50 55 
(2) INFORMATION FOR SEQ ID NO: 317: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

( A ) NAME / KEY : pep t i de 

(B) LOCATION: 1 . . 38 

(D) OTHER INFORMATION: / Ceres Seq. ID 1663277 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 
Met Glu Leu Ser Ser Glu Glu Val Arg Ser Trp lie Ser Phe Met Pro 
15 10 15 

Cys Val Arg Glu Leu Ala Arg Ser Asp Pro Glu Val Trp Phe Glu Leu 

20 25 30 

Leu Gin Thr Val Phe Pro 
35 

(2) INFORMATION FOR SEQ ID NO: 318: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .759 

(D) OTHER INFORMATION: / Ceres Seq. ID 1665304 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



atcaaccaaa acaaaacata aaaaacaagt ggaagcttta aaacgagagg gagagagcaa 60 

aaatggcgac gtcgggaacg tacgtgacgg aagttccgct aaaaggatcg gccgagaaat 120 

actacaagag gtggaagaac gagaaccatg tcttccctga tgctatcggc caccacatcc 180 

aaaatgttac cgttcacgaa ggcgaacatg actctcacgg gtctatcagg agttggaact 240 

acacatggga tggaaaggag gaggtgttca aggagagaag agagatagac gatgagacca 300 

aaacgttgac gttaagagga cttgagggtc acgtgatgga gcagctcaaa gtgtacgacg 360 

tcgtctacca attcattccc aaatctgagg atacctgcat cggcaaaatc actttaatat 420 

gggagaagcg caacgatgat tccccagaac caagcggcta catgaaattc gtcaagagct 480 
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tggttgctga catgggaaac cacgttagca aaacttaatc atcattccca cagtcgtcgt 540 

cgtcgtcatc atcatcatca tcatcatcat catcatcatc atcatcatca tcatcatcat 600 

catcactatc tcgatttata agttaagatg ttttcagtat aataaatggg gtcttgtgga 660 

tcgttcattt ctatgtgtaa accgtttggt tctgtatgat gcttcgatat attgttatgt 720 



tcatgatcat atgtcgggtt cgatataatg attcttaag 
(2) INFORMATION FOR SEQ ID NO: 319: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS : 
■(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 151 

(D) OTHER INFORMATION: / Ceres Seq. ID 1665305 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 



Met 


Ala 


Thr 


Ser 


Gly 


Thr 


Tyr 


Val 


Thr 


Glu 


Val 


Pro 


Leu 


Lys 


Gly 


Ser 


1 








5 










10 










15 




Ala 


Glu 


Lys 


Tyr 


Tyr 


Lys 


Arg 


Trp 


Lys 


Asn 


Glu 


Asn 


His 


Val 


Phe 


Pro 








20 










25 










30 






Asp 


Ala 


He 


Gly 


His 


His 


He 


Gin 


Asn 


Val 


Thr 


Val 


His 


Glu 


Gly 


Glu 






35 










40 










45 








His 


Asp 


Ser 


His 


Gly 


Ser 


He 


Arg 


Ser 


Trp 


Asn 


Tyr 


Thr 


Trp 


Asp 


Gly 




50 










55 










60 










Lys 


Glu 


Glu 


Val 


Phe 


Lys 


Glu 


A.rg 


Arg 


Glu 


He 


Asp 


Asp 


Glu 


Thr 


Lys 


65 










70 










75 










80 


Thr 


Leu 


Thr 


Leu 


Arg 


Gly 


Leu 


Glu 


Gly 


His 


Val 


Met 


Glu 


Gin 


Leu 


Lys 










85 










90 










95 




Val 


Tyr 


Asp 


Val 


Val 


Tyr 


Gin 


Phe 


He 


Pro 


Lys 


Ser 


Glu 


Asp 


Thr 


Cys 








100 










105 










110 






He 


Gly 


Lys 


He 


Thr 


Leu 


He 


Trp 


Glu 


Lys 


Arg 


Asn 


Asp 


Asp 


Ser 


Pro 






115 










120 










125 








Glu 


Pro 


Ser 


Gly 


Tyr 


Met 


Lys 


Phe 


Val 


Lys 


Ser 


Leu 


Val 


Ala 


Asp 


Met 




130 










135 










140 










Gly 


Asn 


His 


Val 


Ser 


Lys 


Thr 





















145 150 
(2) INFORMATION FOR SEQ ID NO: 320: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 602 

(D) OTHER INFORMATION: / Ceres Seq. ID 1709970 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



gaagtctctt atatataagt atcgagagaa ttagggtttt ctagtctagt gtttccggga 60 

gcagcctgag caggaggaag aagacgaaga agaagaggag cagctactac catggttctg 120 

ggacggagct ttgtcgtttc agtggacaga agatttaccc aggaagagga attaggttta 180 

tccgatctga ttctcaggtt ttcttgtttc ttaactcaaa atgtaagagg tacttccata 240 

acaagttgaa gccatccaag cttgcatgga ctgccatgta cagaaagcaa cncaagaagg 300 

atgcagcaca agaggctgtg aagagaagga gacgtgccac caagaagcca tactcaaggt 360 

ccattgttgg tgctaccttg gaagtaattc agaagaagag agctgagaag cctgaagttc 420 

gtgatgcagc cagggaagct gctctgcgtg agatcaagga aagaatcaaa aagaccaaag 480 

atgaaaagaa ggctaagaag gtggaatttg cttctaagca acagaaggtc aaggctaatt 540 

tccccaaagc tgctgctgca tccaagggtc ctaaggtggg aggtggtggt ggcaaacgct 600 
ga 

(2) INFORMATION FOR SEQ ID NO: 321: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. . 108 

(D) OTHER INFORMATION: / Ceres Seq. ID 1709971 





(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO:321 












Met 


Tyr 


Arg 


Lys 


Gin 


Xaa 


Lys 


Lys 


Asp 


Ala 


Ala 


Gin 


Glu 


Ala 


Val 


Lys 


1 








5 










10 










15 




Arg 


Arg 


Arg 


Arg 
20 


Ala 


Thr 


Lys 


Lys 


Pro 
25 


Tyr 


Ser 


Arg 


Ser 


He 
30 


Val 


Gly 


Ala 


Thr 


Leu 
35 


Glu 


Val 


He 


Gin 


Lys 
40 


Lys 


Arg 


Ala 


Glu 


Lys 
45 


Pro 


Glu 


Val 


Arg 


Asp 
50 


Ala 


Ala 


Arg 


Glu 


Ala 
55 


Ala 


Leu 


Arg 


Glu 


He 
60 


Lys 


Glu 


Arg 


He 


Lys 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 


Lys 


Lys 


Val 


Glu 


Phe 


Ala 


Ser 


65 










70 










75 










80 


Lys 


Gin 


Gin 


Lys 


Val 
85 


Lys 


Ala 


Asn 


Phe 


Pro 
90 


Lys 


Ala 


Ala 


Ala 


Ala 
95 


Ser 


Lys 


Gly 


Pro 


Lys 
100 


Val 


Gly 


Gly 


Gly 


Gly 
105 


Gly 


Lys 


Arg 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO:322: 

















fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 59 

(D) OTHER INFORMATION: / Ceres Seq. ID 1709972 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
Met Gin Pro Gly Lys Leu Leu Cys Val Arg Ser Arg Lys Glu Ser Lys 
1 5 10 15 

Arg Pro Lys Met Lys Arg Arg Leu Arg Arg Trp Asn Leu Leu Leu Ser 

20 25 30 

Asn Arg Arg Ser Arg Leu He Ser Pro Lys Leu Leu Leu His Pro Arg 

35 " 40 45 

Val Leu Arg Trp Glu Val Val Val Ala Asn Ala 

50 55 
(2) INFORMATION FOR SEQ ID NO: 323: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : ~ 

(B) LOCATION: 1..726 

(D) OTHER INFORMATION: / Ceres Seq. ID 1711273 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
atcttcttgt ctctctctca tttcttgtta ttatccacga acgaagaaaa acctagaaaa 60 
cagttgaaga aagaaaatca caagagaagc catggccgga attggaccga ttactcagga 120 
ttgggaacca gttgtgatcc gcaagagagc tcctaacgct gcagctaagc gcgacgagaa 180 
gacrgtcaac gccgctcgtc gaagcggcgc cgatattgag accgttcgaa aattcaatgc 240 
tggatcgaac aaggctgcat caagcggcac ctccttgaac acaaagaagc tagatgatga 300 
tactgagaac ttatctcatg atcgtgtgcc cactgaattg aagaaagcca tcatgcaagc 360 
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tagaggggag aasaagctga ctcagtccca acttgcccat ctgatcaatg agaagccaca 420 

agtgatccaa gaatacgagt cggggaaagc aattccgaat caacagatcc tttcaaagct 480 

ggagagggca cttggtgcta aactccgtgg aaagaagtag aagtttagaa caaagctctt 540 

aaaggtaaca aaaagctgat cgcagtttct ctccagtcca catgctttac catatcctaa 600 

aaactatatc tatgtatggt ttggtttaat ggcgtagtag tttgttgcga ggaatctttc 660 

atgatgtaag aaaaacaaag ctgtttggaa ccttttgtca ttataaataa tctcttctct 720 



ttcttt 

(2) INFORMATION FOR SEQ ID NO: 324: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 
"(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..142 

(D) OTHER INFORMATION: / Ceres Seq. ID 1711274 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 



Met 


Ala 


Gly 


He 


Gly 


Pro 


He 


Thr 


Gin 


Asp 


Trp 


Glu 


Pro 


Val 


Val 


He 


1 








5 










10 










15 




Arg 


Lys 


Arg 


Ala 


Pro 


Asn 


Ala 


Ala 


Ala 


Lys 


Arg 


Asp 


Glu 


Lys 


Thr 


Val 








20 










25 










30 






Asn 


Ala 


Ala 


Arg 


Arg 


Ser 


Gly 


Ala 


Asp 


He 


Glu 


Thr 


Val 


Arg 


Lys 


Phe 






35 










40 










45 








Asn 


Ala 


Gly 


Ser 


Asn 


Lys 


Ala 


Ala 


Ser 


Ser 


Gly 


Thr 


Ser 


Leu 


Asn 


Thr 




50 










55 










60 










Lys 


Lys 


Leu 


Asp 


Asp 


Asp 


Thr 


Glu 


Asn 


Leu 


Ser 


His 


Asp 


Arg 


Val 


Pro 


65 










70 










75 










80 


Thr 


Glu 


Leu 


Lys 


Lys 


Ala 


He 


Met 


Gin 


Ala 


Arg 


Gly 


Glu 


Xaa 


Lys 


Leu 










85 










90 










95 




Thr 


Gin 


Ser 


Gin 


Leu 


Ala 


His 


Leu 


He 


Asn 


Glu 


Lys 


Pro 


Gin 


Val 


He 








100 










105 










110 






Gin 


Glu 


Tyr 


Glu 


Ser 


Gly 


Lys 


Ala 


He 


Pro 


Asn 


Gin 


Gin 


He 


Leu 


Ser 






115 










120 










125 








Lys 


Leu 


Glu 


Arg 


Ala 


Leu 


Gly 


Ala 


Lys 


Leu 


Arg 


Gly 


Lys 


Lys 








130 










135 










140 










(2) 


INFORMATION 


FOR 


SEQ 


ID 1 


MO:325: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 68 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1715423 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:325: 



aacaataaaa caaagtgatt tattatggcg ttaagagaga tgataataga tgagagtgat 60 

gacatagaga gctacagtga tcaatctctt tgtcttgata aagccaagga gcttcttgct 120 

ctcatcaaac tacccacggg attgttgcca ttgaaggata tgacggaagt tgggtacaac 180 

aaaacaaaag ggtttgtgtg gatgaggctg agaagcaaga ttgagcatac attccgtgag 240 

ataggtcgca gagttttata tgacacggYa gataactgcg tttgttgagg accgccgtat 300 

gaggagactt accggagtca aaagcaaaga gctcatgatc tgggttcctg tgaatgatat 360 

cttcatcaaa gagaaagatc ctgagaagat cacctttgct aataccaccg gcctgtcacg 420 

aacatttaaa gtttcagcat ttcaatgtga aggttgataa ataaaaagaa aagaggagta 480 

accagacttc cttgatgaaa gcaaccaata aagaacagaa gttgttccat actcaataag 540 

agatagttta attaaatatc aaggacacgc ctttcttaca atgaattgct tatggatcat 600 

cacagtaaat aaacaggttc tgcaattgat gccttgatgt ataacagtga gtttccaatc 660 
aacatattag aatcgattca ttggtt 
(2) INFORMATION FOR SEQ ID NO: 32 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 87 

(D) OTHER INFORMATION: / Ceres Seq. ID 1715424 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 



Met 


Ala 


Leu 


Arg 


Glu 


Met 


He 


He Asp 


Glu 


Ser 


Asp 


Asp 


He 


Glu 


Ser 


1 








5 








10 










15 




Tyr 


Ser 


Asp 


Gin 


Ser 


Leu 


Cys 


Leu Asp 


Lys 


Ala 


Lys 


Glu 


Leu 


Leu 


Ala 








20 








25 










30 






Leu 


He 


Lys 


Leu 


Pro 


Thr 


Gly 


Leu Leu 


Pro 


Leu 


Lys 


Asp 


Met 


Thr 


Glu 






35 










40 








45 








Val 


Gly 


Tyr 


Asn 


Lys 


Thr 


Lys 


Gly Phe 


Val 


Trp 


Met 


Arg 


Leu 


Arg 


Ser 




50 










55 








60 










Lys 


He 


Glu 


His 


Thr 


Phe 


Arg 


Glu He 


Gly 


Arg 


Arg 


Val 


Leu 


Tyr 


Asp 


65 










70 








75 










80 


Thr 


Xaa 


Asp 


Asn 


Cys 
85 


Val 


Cys 


















(2) 


INFORMATION 


FOR 


SEQ 


ID : 


MO: 327 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 82 

(D) OTHER INFORMATION: / Ceres Seq. ID 1715425 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



Met 


He 


He 


Asp 


Glu 


Ser 


Asp 


Asp 


He 


Glu 


Ser 


Tyr 


Ser 


Asp 


Gin 


Ser 


1 








5 










10 










15 




Leu 


Cys 


Leu 


Asp 


Lys 


Ala 


Lys 


Glu 


Leu 


Leu 


Ala 


Leu 


He 


Lys 


Leu 


Pro 








20 










25 










30 






Thr 


Gly 


Leu 


Leu 


Pro 


Leu 


Lys 


Asp 


Met 


Thr 


Glu 


Val 


Gly 


Tyr 


Asn 


Lys 






35 










40 










45 








Thr 


Lys 


Gly 


Phe 


Val 


Trp 


Met 


Arg 


Leu 


Arg 


Ser 


Lys 


He 


Glu 


His 


Thr 




50 










55 










60 










Phe 


Arg 


Glu 


He 


Gly 


Arg 


Arg 


Val 


Leu 


Tyr 


Asp 


Thr 


Xaa 


Asp 


Asn 


Cys 


65 










70 










75 










80 



Val Cys 

(2) INFORMATION FOR SEQ ID NO: 32 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 65 

(D) OTHER INFORMATION: / Ceres Seq. ID 1715426 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
Met Thr Arg Xaa lie Thr Ala Phe Val Glu Asp Arg Arg Met Arg Arg 
1 ~ 5 10 15 

Leu Thr Gly Val Lys Ser Lys Glu Leu Met He Trp Val Pro Val Asn 
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20 25 30 

Asp lie Phe lie Lys Glu Lys Asp Pro Glu Lys lie Thr Phe Ala Asn 

35 " ^40 4 5 

Thr Thr Gly Leu Ser Arg Thr Phe Lys Val Ser Ala Phe Gin Cys Glu 
5 50 55 60 

Gly 
65 

(2) INFORMATION FOR SEQ ID NO: 329: 
(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 829 base pairs 

■(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 829 

(D) OTHER INFORMATION: / Ceres Seq. ID 1715962 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:329: 

20 aaaaacgccg tcgagaaatc tccctctagg gtttctttga tcaaaccaga gagccgtcag 60 
ggagagcaaa acaaagatcg gagaagatga agacgattct ttcttccgaa acgatggaca 120 
tccccgacag tgttaccatc aaggttcacg ctaaagtgat cgaagtcgaa ggacctcgcg 180 
ggaagcttgt tcgcgatttc aagcatctca acctcgattt ccagctgatc aaggatccag 240 
agactggaaa gaagaagctt aagatcgatt cgtggtttgg aacacgcaaa accagcgcct 300 

25 ccatcagaac cgctcttagc cacgtcgata acttgatctc cggtgttacc agaggtttcc 360 
gttacaagat gaggttcgtg tacgcccatt ttcccatcaa cgcctccatc ggcggtgacg 420 
gaaagtctat cgagatccgt aacttccttg gcgagaagaa ggtgaggaag gtagagatgt 480 
tggatggtgt aacCattgtt cgaYctgaga aggtgaagga tgagattgtt cttgacggta 540 
acgacatcga gcttgtttca aggtcatgcg ctttgatcaa ccagaaatgt cacgtgaaga 600 

30 agaaggatat caggaagttt cttgatggta tctatgttag cgagaaaagc aagatcgtag 660 
aggaagaatg aatgcccata tcatcgtctt agctcctatt tcttgtcttt tgaatgttag 720 
tttcgtttta tggaccaaat ccaccggtgt tgcaaaactc tggctatccc ttgttccctt 780 
tttgtgttcg agattttatt tacttatgca caagttttgg agacgagct 
(2) INFORMATION FOR SEQ ID NO: 330: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 194 

(D) OTHER INFORMATION: / Ceres Seq. ID 1715963 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met Lys Thr lie Leu Ser Ser Glu Thr Met Asp lie Pro Asp Ser Val 
1 ' 5 10 15 

Thr lie Lys Val His Ala Lys Val lie Glu Val Glu Gly Pro Arg Gly 
20 25 30 

50 Lys Leu Val Arg Asp Phe Lys His Leu Asn Leu Asp Phe Gin Leu lie 
35 4 0 4 5 

Lys Asp Pro Glu Thr Gly Lys Lys Lys Leu Lys lie Asp Ser Trp Phe 

50 ~ 55 60 

Gly Thr Arg Lys Thr Ser Ala Ser lie Arg Thr Ala Leu Ser His Val 
55 65 70 75 80 

Asp Asn Leu lie Ser Gly Val Thr Arg Gly Phe Arg Tyr Lys Met Arg 

85 ~ 90 95 

Phe Val Tyr Ala His Phe Pro lie Asn Ala Ser lie Gly Gly Asp Gly 
100 105 110 

60 Lys Ser lie Glu lie Arg Asn Phe Leu Gly Glu Lys Lys Val Arg Lys 
115 120 125 

Val Glu Met Leu Asp Gly Val Thr lie Val Arg Xaa Glu Lys Val Lys 
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130 135 140 

Asp Glu He Val Leu Asp Gly Asn Asp He Glu Leu Val Ser Arg Ser 
145 150 155 160 

Cys Ala Leu lie Asn Gin Lys Cys His Val Lys Lys Lys Asp He Arg 

165 170 175 

Lys Phe Leu Asp Gly He Tyr Val Ser Glu Lys Ser Lys lie Val Glu 
180 185 190 

Glu Glu 



10 (2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..185 

20 (D) OTHER INFORMATION: / Ceres Seq. ID 1715964 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 





Met 


Asp 


He 


Pro 


Asp 


Ser 


Val 


Thr 


He 


Lys 


Val 


His 


Ala 


Lys 


Val 


He 




1 








5 










10 










15 






Glu 


Val 


Glu 


Gly 


Pro 


Arg 


Gly 


Lys 


Leu 


Val 


Arg 


Asp 


Phe 


Lys 


His 


Leu 


25 








20 










25 










30 








Asn 


Leu 


Asp 
35 


Phe 


Gin 


Leu 


He 


Lys 
40 


Asp 


Pro 


Glu 


Thr 


Gly 
45 


Lys 


Lys 


Lys 




Leu 


Lys 
50 


lie 


Asp 


Ser 


Trp 


Phe 
55 


Gly 


Thr 


Arg 


Lys 


Thr 
60 


Ser 


Ala 


Ser 


He 


30 


Arg 
65 


Thr 


Ala 


Leu 


Ser 


His 
70 


Val 


Asp 


Asn 


Leu 


lie 
75 


Ser 


Gly 


Val 


Thr 


Arg 

80 




Gly 


Phe 


Arg 


Tyr 


Lys 
85 


Met 


Arg 


Phe 


Val 


Tyr 
90 


Ala 


His 


Phe 


Pro 


lie 
95 


Asn 




Ala 


Ser 


He 


Gly 


Gly 


Asp 


Gly 


Lys 


Ser 


He 


Glu 


lie 


Arg 


Asn 


Phe 


Leu 


35 








100 










105 










110 








Gly 


Glu 


Lys 
115 


Lys 


Val 


Arg 


Lys 


Val 
120 


Glu 


Met 


Leu 


Asp 


Gly 
125 


Val 


Thr 


lie 




Val 


Arg 
130 


Xaa 


Glu 


Lys 


Val 


Lys 
135 


Asp 


Glu 


He 


Val 


Leu 
140 


Asp 


Gly 


Asn 


Asp 


40 


He 
145 


Glu 


Leu 


Val 


Ser 


Arg 
150 


Ser 


Cys 


Ala 


Leu 


lie 
155 


Asn 


Gin 


Lys 


Cys 


His 
160 




Val 


Lys 


Lys 


Lys 


Asp 
165 


He 


Arg 


Lys 


Phe 


Leu 
170 


Asp 


Gly 


He 


Tyr 


Val 
175 


Ser 




Glu 


Lys 


Ser 


Lys 


He 


Val 


Glu 


Glu 


Glu 
















45 








180 










185 

















(2) INFORMATION FOR SEQ ID NO: 332: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 
50 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
5 5 (B) LOCATION: 1..100 

(D) OTHER INFORMATION: / Ceres Seq. ID 1715965 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 
Met Arg Phe Val Tyr Ala His Phe Pro lie Asn Ala Ser He Gly Gly 
15 10 15 

60 Asp Gly Lys Ser He Glu He Arg Asn Phe Leu Gly Glu Lys Lys Val 

20 25 30 

Arg Lys Val Glu Met Leu Asp Gly Val Thr He Val Arg Xaa Glu Lys 
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35 40 45 

Val Lys Asp Glu lie Val Leu Asp Gly Asn Asp lie Glu Leu Val Ser 

50 ' 55 60 

Arg Ser Cys Ala Leu lie Asn Gin Lys Cys His Val Lys Lys Lys Asp 
65 " 7 0 75 80 

lie Arg Lys Phe Leu Asp Gly lie Tyr Val Ser Glu Lys Ser Lys lie 
85 90 95 

Val Glu Glu Glu 
100 

(2) INFORMATION FOR SEQ ID NO: 333: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
DNA (genomic) 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



(xi) 



1 . . 675 

(D) OTHER INFORMATION: / Ceres Seq. ID 1808584 
SEQUENCE DESCRIPTION: SEQ ID NO: 333: 
aacttgtcac ctgcagagaa agaaagaagc cctagatttt gtcaaaaggc ggttgcagaa 
caaaaaacca tggccggaat tggaccgata actcaggatt gggagccggr ggtgatccgt 
aagaaacccg ctaacgccgc tgccaagcgc gacgagaaaa ctgtcaacqc cgctcgtcga 
tccggcgccg atatcgagac cgtcagaaaa ttcaatgctg gaaccaacaa ggcggcatca 
agcggcacat ctctgaacac aaaaatgctt gatgatgaca ctgagaaccr tactcatgaa 
cgtgtgccta ctgagctaaa gaaagccatt atgcaagcca ggacagacaa gaagctaacc 
cagtcccaac ttgctcaaat catcaatgag aagccacaag tgattcaaga gtatgagtct 
ggcaaagcta tacccaacca gcaaatcctt tctaagctgg agagagcgct tggagctaag 
cttcgtggaa agaagtgagc caagttctac tgatgtagca agtaacaaga atcaatgctt 
tcgtctaatg ccgtaacttt gccaagaaga atattttctg attgtaagaa agcaaaaccg 
tttgaatgtt tgtttcgttg atggaatctc tatctcataa actcatatca atataataac 
ttgggtcttt tcatc 

(2) INFORMATION FOR SEQ ID NO: 334: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE : 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..165 

(D) OTHER INFORMATION: / Ceres Seq. ID 1808585 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:334: 
Asn Leu Ser Pro Ala Glu Lys Glu Arg Ser Pro Arg Phe Cys Gin Lys 
15 10 15 

Ala Val Ala Glu Gin Lys Thr Met Ala Gly lie Gly Pro lie Thr Gin 



(ii) 
(ix) 



20 



25 



30 



Asp Trp Glu Pro Val Val lie Arg Lys Lys Pro Ala Asn Ala Ala Ala 

35 4 0 4 5 

Lys Arg Asp Glu Lys Thr Val Asn Ala Ala Arg Arg Ser Gly Ala Asp 

50 55 60 

lie Glu Thr Val Arg Lys Phe Asn Ala Gly Thr Asn Lys Ala Ala Ser 
65 70 75 80 

Ser Gly Thr Ser Leu Asn Thr Lys Met Leu Asp Asp Asp Thr Glu Asn 

85 90 95 

Leu Thr His Glu Arg Val Pro Thr Glu Leu Lys Lys Ala lie Met Gin 

100 105 110 

Ala Arg Thr Asp Lys Lys Leu Thr Gin Ser Gin Leu Ala Gin lie lie 

115 " 120 125 

Asn Glu Lys Pro Gin Val lie Gin Glu Tyr Glu Ser Gly Lys Ala lie 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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130 135 140 

Pro Asn Gin Gin lie Leu Ser Lys Leu Glu Arg Ala Leu Gly Ala Lys 
145 150 155 160 

Leu Arg Gly Lys Lys 
5 "* 165 

(2) INFORMATION FOR SEQ ID NO: 335: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
15 (B) LOCATION: 1..142 

(D) OTHER INFORMATION: / Ceres Seq. ID 1808586 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
Met Ala Gly He Gly Pro He Thr Gin Asp Trp Glu Pro Val Val He 
15 10 15 

2 0 Arg Lys Lys Pro Ala Asn Ala Ala Ala Lys Arg Asp Glu Lys Thr Val 

20 25 30 

Asn Ala Ala Arg Arg Ser Gly Ala Asp He Glu Thr Val Arg Lys Phe 

35 40 45 

Asn Ala Gly Thr Asn Lys Ala Ala Ser Ser Gly Thr Ser Leu Asn Thr 
25 50 55 60 

Lys Met Leu Asp Asp Asp Thr Glu Asn Leu Thr His Glu Arg Val Pro 
65 70 75 80 

Thr Glu Leu Lys Lys Ala He Met Gin Ala Arg Thr Asp Lys Lys Leu 
85 90 95 

30 Thr Gin Ser Gin Leu Ala Gin He He Asn Glu Lys Pro Gin Val He 

100 105 110 

Gin Glu Tyr Glu Ser Gly Lys Ala He Pro Asn Gin Gin He Leu Ser 

115 ~ 120 125 

Lys Leu Glu Arg Ala Leu Gly Ala Lys Leu Arg Gly Lys Lys 
35 " 130 135 140 

(2) INFORMATION FOR SEQ ID NO: 336: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

4 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
4 5 (B) LOCATION: 1 . . 630 

(D) OTHER INFORMATION: / Ceres Seq. ID 1808591 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:336: 
aacactacaa caattttcat acctcatatt ctcttccatg gcaaccacca agcacctcct 60 
cctcttcctc ctcatcatcg ctatcatcac cattacatct tccacttcac ttccatttct 120 
50 cacgacggag caaaaccaaa tcgcaaccaa aatcatagac gcaatggtct caagtggctc 180 
tttcgaagat tggagcggag cgttcctcaa caacaacgac gaattaaacg gtccagtcct 240 
aacttcaact ctcttcctcc ccaaaacatc cgtggaagga atcaacgcca cgtcaccact 300 
agttgcttct taccatattg ttccacaatg gcttgacttc tccgtcataa gtctcatgat 360 
gcctttctct cgcatcccta cactcctctc tggacactct atagtcgtta ccaacaattc 420 
55 agcttctggt tttactcttg atggtgttct catctctgag ccagatttat tcgtctctcc 480 
tactatagtt atccatcgaa tggcttttcc atttaacttc tctcgttacg gtggtggcga 540 
tatatagtta tttacttttg attcgcttct ccactttgtt tctttctacr tttactaata 600 
cactttcaaa taattaacaa cgaacttgac 
(2) INFORMATION FOR SEQ ID NO: 337: 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 181 

(D) OTHER INFORMATION: / Ceres Seq. ID 1808592 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 
Thr Leu Gin Gin Phe Ser Tyr Leu lie Phe Ser Ser Met Ala Thr Thr 
10 1 5 10 15 

Lys His Leu Leu Leu Phe Leu Leu lie lie Ala lie lie Thr lie Thr 

20 25 30 

Ser Ser Thr Ser Leu Pro Phe Leu Thr Thr Glu Gin Asn Gin lie Ala 
35 40 45 

15 Thr Lys lie lie Asp Ala Met Val Ser Ser Gly Ser Phe Glu Asp Trp 
50 5 5 60 

Ser Gly Ala Phe Leu Asn Asn Asn Asp Glu Leu Asn Gly Pro Val Leu 
65 70 • 75 80 

Thr Ser Thr Leu Phe Leu Pro Lys Thr Ser Val Glu Gly lie Asn Ala 
20 85 90 95 

Thr Ser Pro Leu Val Ala Ser Tyr His lie Val Pro Gin Trp Leu Asp 

100 105 110 

Phe Ser Val lie Ser Leu Met Met Pro Phe Ser Arg He Pro Thr Leu 
115 120 125 

25 Leu Ser Gly His Ser He Val Val Thr Asn Asn Ser Ala Ser Gly Phe 
130 135 140 

Thr Leu Asp Gly Val Leu lie Ser Glu Pro Asp Leu Phe Val Ser Pro 
145 150 155 160 

Thr He Val He His Arg Met Ala Phe Pro Phe Asn Phe Ser Arg Tyr 
30 165 170 175 

Gly Gly Gly Asp He 
180 

(2) INFORMATION FOR SEQ ID NO- 338: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..169 

(D) OTHER INFORMATION: / Ceres Seq. ID 1808593 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
45 Met Ala Thr Thr Lys His Leu Leu Leu Phe Leu Leu He lie Ala He 
15 10 15 

He Thr He Thr Ser Ser Thr Ser Leu Pro Phe Leu Thr Thr Glu Gin 

20 25 30 

Asn Gin He Ala Thr Lys He He Asp Ala Met Val Ser Ser Gly Ser 
50 35 ' 40 45 

Phe Glu Asp Trp Ser Gly Ala Phe Leu Asn Asn Asn Asp Glu Leu Asn 

50 55 60 

Gly Pro Val Leu Thr Ser Thr Leu Phe Leu Pro Lys Thr Ser Val Glu 
65 70 75 80 

55 Gly He Asn Ala Thr Ser Pro Leu Val Ala Ser Tyr His He Val Pro 

85 90 95 

Gin Trp Leu Asp Phe Ser Val He Ser Leu Met Met Pro Phe Ser Arg 

100 105 110 

He Pro Thr Leu Leu Ser Gly His Ser He Val Val Thr Asn Asn Ser 
60 115 120 125 

Ala Ser Gly Phe Thr Leu Asp Gly Val Leu He Ser Glu Pro Asp Leu 
130 135 140 
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Phe Val Ser Pro Thr He Val He His Arg Met Ala Phe Pro Phe Asn 
145 150 155 160 

Phe Ser Arg Tyr Gly Gly Gly Asp He 
165 

(2) INFORMATION FOR SEQ ID NO:339: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) • MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..127 

(D) OTHER INFORMATION: / Ceres Seq. ID 1808594 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 



Met 


Val 


Ser 


Ser 


Gly 


Ser 


Phe 


Glu 


Asp 


Trp 


Ser 


Gly 


Ala 


Phe 


Leu 


Asn 


1 








5 










10 










15 




Asn 


Asn 


Asp 


Glu 
20 


Leu 


Asn 


Gly 


Pro 


Val 
25 


Leu 


Thr 


Ser 


Thr 


Leu 
30 


Phe 


Leu 


Pro 


Lys 


Thr 
35 


Ser 


Val 


Glu 


Gly 


He 
4 0 


Asn 


Ala 


Thr 


Ser 


Pro 
45 


Leu 


Val 


Ala 


Ser 


Tyr 
50 


His 


He 


Val 


Pro 


Gin 
55 


Trp 


Leu 


Asp 


Phe 


Ser 
60 


Val 


He 


Ser 


Leu 


Met 


Met 


Pro 


Phe 


Ser 


Arg 


He 


Pro 


Thr 


Leu 


Leu 


Ser 


Gly 


His 


Ser 


He 


65 










70 










75 










80 


Val 


Val 


Thr 


Asn 


Asn 
85 


Ser 


Ala 


Ser 


Gly 


Phe 
90 


Thr 


Leu 


Asp 


Gly 


Val 
95 


Leu 


He 


Ser 


Glu 


Pro 
100 


Asp 


Leu 


Phe 


Val 


Ser 
105 


Pro 


Thr 


He 


Val 


He 
110 


His 


Arg 


Met 


Ala 


Phe 
115 


Pro 


Phe 


Asn 


Phe 


Ser 
120 


Arg 


Tyr 


Gly 


Gly 


Gly 
125 


Asp 


He 





(2) INFORMATION FOR SEQ ID NO: 340: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .717 

(D) OTHER INFORMATION: / Ceres Seq. ID 1920563 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



cttgtttata ctaatgacca aggagagact gttaaaggag tttgctcaaa tttcttgtgt 60 

gatttggcac ccggaagtga tgttaagctg actggtcctg taggcaaaga aatgcttatg 120 

cctaaggatc caaacgccac cgtaattatg cttgccacag ggacaggaat tgctcctttc 180 

aggtctttct tatggaagat gttctttgag aaacatgatg actacaagtt caatggctta 240 

gcttggttgt tcttgggtgt accaaccact agctcattgc tctaccaaga ggagtttgat 300 

aagatgaaag caaaggcccc cgagaacttc agggtggatt acgcgataag cagagaacaa 360 

gcgaacgata aaggagagaa aatgtatatc cagactcgga tggcacagta cgcagctgaa 420 

ttatgggagt tgttgaagaa agacaacact tttgtttaca tgtgtggact caagggaatg 480 

gagaaaggaa ttgatgacat tatggtctca ttggctgcaa atgacggtat tgactggttt 540 

gattacaaga agcagttgaa gaaggcagag caatggaacg ttgaagtcta ctgatcaaaa 600 

agcctttgac atttctgtag caaagtatag ctgaacaaaa ctgtaatttt cgcttccgaa 660 



tttctgtatt ttgaagataa gttttttaga tatgttatac taaaaaaaga gttcttt 
(2) INFORMATION FOR SEQ ID NO: 341: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE : 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..197 

5 (D) OTHER INFORMATION: / Ceres Seq. ID 1920564 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:341: 
Leu Val Tyr Thr Asn Asp Gin Gly Glu Thr Val Lys Gly Val Cys Ser 
15 10 15 

Asn Phe Leu Cys Asp Leu Ala Pro Gly Ser Asp Val Lys Leu Thr Gly 
10 20 25 ^ 30 

Pro Val Gly Lys Glu Met Leu Met Pro Lys Asp Pro Asn Ala Thr Val 

35 40 45 

lie Met Leu Ala Thr Gly Thr Gly lie Ala Pro Phe Arg Ser Phe Leu 
50 55 60 

15 Trp Lys Met Phe Phe Glu Lys His Asp Asp Tyr Lys Phe Asn Gly Leu 
65 7 0 7 5 80 

Ala Trp Leu Phe Leu Gly Val Pro Thr Thr Ser Ser Leu Leu Tyr Gin 

8 5 90 95 

Glu Glu Phe Asp Lys Met Lys Ala Lys Ala Pro Glu Asn Phe Arg Val 
20 100 105 110 

Asp Tyr Ala lie Ser Arg Glu Gin Ala Asn Asp Lys Gly Glu Lys Met 

115 120 125 

Tyr lie Gin Thr Arg Met Ala Gin Tyr Ala Ala Glu Leu Trp Glu Leu 
130 135 140 

2 5 Leu Lys Lys Asp Asn Thr Phe Val Tyr Met Cys Gly Leu Lys Gly Met 
145 150 155 ' 160 

Glu Lys Gly lie Asp Asp lie Met Val Ser Leu Ala Ala Asn Asp Gly 

165 17 0 175 

lie Asp Trp Phe Asp Tyr Lys Lys Gin Leu Lys Lys Ala Glu Gin Trp 
30 180 * 185 190 

Asn Val Glu Val Tyr 
195 

(2) INFORMATION FOR SEQ ID NO: 342: 
(i) SEQJENCE CHARACTERISTICS: 
35 (A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..1 60 

(D) OTHER INFORMATION: / Ceres Seq. ID 1920565 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
4 5 Met Leu Met Pro Lys Asp Pro Asn Ala Thr Val lie Met Leu Ala Thr 
15 10 15 

Gly Thr Gly lie Ala Pro Phe Arg Ser Phe Leu Trp Lys Met Phe Phe 

20 25 30 

Glu Lys His Asp Asp Tyr Lys Phe Asn Gly Leu Ala Trp Leu Phe Leu 
50 35 40 45 

Gly Val Pro Thr Thr Ser Ser Leu Leu Tyr Gin Glu Glu Phe Asp Lys 

50 55 60 

Met Lys Ala Lys Ala Pro Glu Asn Phe Arg Val Asp Tyr Ala lie Ser 
65 70 75 80 

55 Arg Glu Gin Ala Asn Asp Lys Gly Glu Lys Met Tyr lie Gin Thr Arg 

85 90 95 

Met Ala Gin Tyr Ala Ala Glu Leu Trp Glu Leu Leu Lys Lys Asp Asn 

100 105 110 

Thr Phe Val Tyr Met Cys Gly Leu Lys Gly Met Glu Lys Gly lie Asp 
60 115 120 125 

Asp lie Met Val Ser Leu Ala Ala Asn Asp Gly He Asp Trp Phe Asp 
130 135 140 
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Tyr Lys Lys Gin Leu Lys Lys Ala Glu Gin Trp Asn Val Glu Val Tyr 
145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 343: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..158 

(D) OTHER INFORMATION: / Ceres Seq. ID 1920566 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:343: 



Met 


Pro 


Lys 


Asp 


Pro 


Asn 


Ala 


Thr 


Val 


He 


Met 


Leu 


Ala 


Thr 


Gly 


Thr 


1 








5 










10 










15 




Gly 


He 


Ala 


Pro 


Phe 


Arg 


Ser 


Phe 


Leu 


Trp 


Lys 


Met 


Phe 


Phe 


Glu 


Lys 








20 










25 










30 






His 


Asp 


Asp 


Tyr 


Lys 


Phe 


Asn 


Gly 


Leu 


Ala 


Trp 


Leu 


Phe 


Leu 


Gly 


Val 






35 










40 










45 








Pro 


Thr 


Thr 


Ser 


Ser 


Leu 


Leu 


Tyr 


Gin 


Glu 


Glu 


Phe 


Asp 


Lys 


Met 


Lys 




50 










55 










60 










Ala 


Lys 


Ala 


Pro 


Glu 


Asn 


Phe 


Arg 


Val 


Asp 


Tyr 


Ala 


He 


Ser 


Arg 


Glu 


65 










70 










75 










80 


Gin 


Ala 


Asn 


Asp 


Lys 


Gly 


Glu 


Lys 


Met 


Tyr 


He 


Gin 


Thr 


Arg 


Met 


Ala 










85 










90 










95 




Gin 


Tyr 


Ala 


Ala 


Glu 


Leu 


Trp 


Glu 


Leu 


Leu 


Lys 


Lys 


Asp 


Asn 


Thr 


Phe 








100 










105 










110 






Val 


Tyr 


Met 


Cys 


Gly 


Leu 


Lys 


Gly 


Met 


Glu 


Lys 


Gly 


He 


Asp 


Asp 


lie 






115 










120 










125 








Met 


Val 


Ser 


Leu 


Ala 


Ala 


Asn 


Asp 


Gly 


He 


Asp 


Trp 


Phe 


Asp 


Tyr 


Lys 




130 










135 




















Lys 


Gin 


Leu 


Lys 


Lys 


Ala 


Glu 


Gin 


Trp 


Asn 


Val 


Gl, 


Val 


Tyr 







145 150 155 



(2) INFORMATION FOR SEQ ID NO: 344: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .2192 

(D) OTHER INFORMATION: / Ceres Seq. ID 1974419 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:344: 



aattcaagtc atctctatcc gtcaacaata caaaccaacc ttctcaattc ctctcttttc 60 

atccacattt tcactagcta atggcttcta gttcaatgag cacctcatca tggacagcta 120 

gagaggacaa gcaattcgaa atggcgttgg cgaaattcga caaggacact cctgaccgtt 180 

ggcaaaaaat tgcaagggca gttggtggga aatcaactga agaagtaaag cgacactatg 240 

aattgctcct tagggatgtg aatgacattg agtcaggacg ctatccacaa cctagattat 300 

gttccgcggc ggcgatagct gctgcgttca cttcaatgtc gatgtcacag aaccgcgctt 360 

acgctgactc gcgatttcgc tttcctttct tctcttcttc acctccggca gaagaatctc 420 

caaccgatca taagtcttct tcgaattcta aatcggagac taaacctgac tccgatgagc 480 

ctaaaggatc tggtttcgat cctgagtcat tggaaagagg tgctaaagct cttcgtgaaa 54 0 

tcaatagctc tcctcattcc aaacaggtgt ttgatctaat gcgaaagcag gagaagactc 600 

gattagctga attagcggct gagaaagaac ataatgaagc tattcaagct agcaaggaca 660 

ttgaaagaca gcggaaattg gcagaggatc agagaaattt agtgcagcaa caggcccaag 720 

cgaaagcgca aaatcttaga tatgaggatg agttggccag gaagagaatg cagacagata 780 

atgaagctca gagacggcat aatgctgaat tggtttcgat gcaagaagca tcttctatac 840 

ggaaagagaa agcaagaatt gccacggaag aacagattca agcacagcag cgcgagactg 900 

agaaagagag agcggaactt gagcgagaaa caattcgcgt gaaagctatg gctgaggctg 960 
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aaggccgagc tcatgaagct aaactcactg aggagcagaa 
agataaatgg tgaaagggag aaatggcttg cagcaatcaa 
aaggaggagt aaggacctta ttaactgatc gaagtaaatt 
ttacggcatt agccgctggg gtttacacaa ctcgagaagg 
5 atatcaatag aattcttggg cagccatcac tgatcagaga 
catgggcagg ctcagtgtct cagtttaaga acaaacttag 
catctgcaga aggagaaaag cctcttgaaa atgtaattct 
gaattgagcg tcttgcaaga gccacagcaa ataccaagtc 
acatgatgtt ttatgggcct ccaggtaccg gaaaaactat 

10 ggaagtcggg tcttgattat gctatgatga caggaggaga 
aggctgttac aaagatccat gaaatatttg attgggctaa 
tgcttttcat cgatgaagct gatgctttcc tatgcgaacg 
aggctcagcg cagcgctctg aacgcgttgc tctttcgaac 
tagtccttgt cctggctaca aacagacctg gagatctcga 

15 tcgacgaagt tatcgagttt cctctccctg gtgaagaaga 
tctatctcaa caagtaccta atgggtgacg acaagaaagg 
aatggagcaa cttgttcaag aagaagaagt cacagaagat 
ccgaccaagt gattaaagaa gctgcaaaaa agacagaagg 
ctaagcttgt cgccggtgtt caagctgcgg tatatggacg 

20 cccaactttt tgaagaaatt gtggattata agatcgaaga 
ttgcgactga aggtggccaa tcgtttccgt ag 
(2) INFORMATION FOR SEQ ID NO: 345: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 703 amino acids 

25 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

30 (A) NAME/KEY: peptide 



(B) LOCATION: 1..703 

(D) OTHER INFORMATION: / Ceres Seq. ID 1974420 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 



Met 


Ala 


Ser 


Ser 


Ser 


Met 


Ser 


Thr 


Ser 


Ser 


Trp 


Thr 


Ala 


Arg 


Glu 


Asp 


1 








5 










10 










15 




Lys 


Gin 


Phe 


Glu 


Met 


Ala 


Leu 


Ala 


Lys 


Phe 


Asp 


Lys 


Asp 


Thr 


Pro 


Asp 








20 










25 










30 






Arg 


Trp 


Gin 


Lys 


He 


Ala 


Arg 


Ala 


Val 


Gly 


Gly 


Lys 


Ser 


Thr 


Glu 


Glu 






35 










40 










45 








Val 


Lys 


Arg 


His 


Tyr 


Glu 


Leu 


Leu 


Leu 


Arg 


Asp 


Val 


Asn 


Asp 


He 


Glu 




50 










55 










60 










Ser 


Gly 


Arg 


Tyr 


Pro 


Gin 


Pro 


Arg 


Leu 


Cys 


Ser 


Ala 


Ala 


Ala 


He 


Ala 


65 










70 










75 










80 


Ala 


Ala 


Phe 


Thr 


Ser 


Met 


Ser 


Met 


Ser 


Gin 


Asn 


Arg 


Ala 


Tyr 


Ala 


Asp 










85 










90 










95 




Ser 


Arg 


Phe 


Arg 


Phe 


Pro 


Phe 


Phe 


Ser 


Ser 


Ser 


Pro 


Pro 


Ala 


Glu 


Glu 








100 










105 










110 






Ser 


Pro 


Thr 


Asp 


His 


Lys 


Ser 


Ser 


Ser 


Asn 


Ser 


Lys 


Ser 


Glu 


Thr 


Lys 






115 










120 










125 








Pro 


Asp 


Ser 


Asp 


Glu 


Pro 


Lys 


Gly 


Ser 


Gly 


Phe 


Asp 


Pro 


Glu 


Ser 


Leu 




130 










135 










140 










Glu 


Arg 


Gly 


Ala 


Lys 


Ala 


Leu 


Arg 


Glu 


He 


Asn 


Ser 


Ser 


Pro 


His 


Ser 


145 










150 










155 










160 


Lys 


Gin 


Val 


Phe 


Asp 


Leu 


Met 


Arg 


Lys 


Gin 


Glu 


Lys 


Thr 


Arg 


Leu 


Ala 










165 










170 










175 




Glu 


Leu 


Ala 


Ala 


Glu 


Lys 


Glu 


His 


Asn 


Glu 


Ala 


He 


Gin 


Ala 


Ser 


Lys 








180 










185 










190 






Asp 


He 


Glu 


Arg 


Gin 


Arg 


Lys 


Leu 


Ala 


Glu 


Asp 


Gin 


Arg 


Asn 


Leu 


Val 






195 










200 










205 








Gin 


Gin 


Gin 


Ala 


Gin 


Ala 


Lys 


Ala 


Gin 


Asn 


Leu 


Arg 


Tyr 


Glu 


Asp 


Glu 




210 










215 










220 










Leu 


Ala 


Arg 


Lys 


Arg 


Met 


Gin 


Thr 


Asp 


Asn 


Glu 


Ala 


Gin 


Arg 


Arg 


His 



tagaagaatg 
cacgactttc 
gattatgact 
tgctagggtt 
atcttccatg 
cacagctgca 
ccatcgttct 
acataaagca 
ggtggcgagg 
tgttgctcct 
gaaat caaac 
taacagcact 
cggtgatcaa 
cagtgcagtc 
acgcttcaag 
t gagaaagac 
aaccattgaa 
cttttctggt 
acaggattgt 
acatcaccag 



cttctagata 
agtcacattg 
gttggaggag 
acctggggtt 
ggccgattcc 
ggggcagcag 
ttgaagacga 
ccattccgaa 
gagattgctc 
cttggtgcac 
aaagggttac 
tacatgagtg 
tcgcgggaca 
actgacagga 
ctcct caagc 
t caaacctta 
ggagacctaa 
cgtgaaatcg 
gtcttggatt 
agaatcagac 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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225 230 235 240 

Asn Ala Glu Leu Val Ser Met Gin Glu Ala Ser Ser lie Arg Lys Glu 

245 250 255 

Lys Ala Arg lie Ala Thr Glu Glu Gin lie Gin Ala Gin Gin Arg Glu 
5 260 265 270 

Thr Glu Lys Glu Arg Ala Glu Leu Glu Arg Glu Thr lie Arg Val Lys 

275 280 285 

Ala Met Ala Glu Ala Glu Gly Arg Ala His Glu Ala Lys Leu Thr Glu 
290 295 300 

10 Glu Gin Asn Arg Arg Met Leu Leu Asp Lys lie Asn Gly Glu Arg Glu 
305 310 315 320 

Lys Trp Leu Ala Ala lie Asn Thr Thr Phe Ser His lie Glu Gly Gly 

325 330 335 

Val Arg Thr Leu Leu Thr Asp Arg Ser Lys Leu lie Met Thr Val Gly 
15 34 0 34 5 350 

Gly Val Thr Ala Leu Ala Ala Gly Val Tyr Thr Thr Arg Glu Gly Ala 

355 360 365 

Arg Val Thr Trp Gly Tyr lie Asn Arg lie Leu Gly Gin Pro Ser Leu 
370 375 380 

20 lie Arg Glu Ser Ser Met Gly Arg Phe Pro Trp Ala Gly Ser Val Ser 
385 390 395 400 

Gin Phe Lys Asn Lys Leu Ser Thr Ala Ala Gly Ala Ala Ala Ser Ala 

4 05 410 ' 415 

Glu Gly Glu Lys Pro Leu Glu Asn Val lie Leu His Arg Ser Leu Lys 
25 ^ 420 425 430 

Thr Arg lie Glu Arg Leu Ala Arg Ala Thr Ala Asn Thr Lys Ser His 

435 440 445 

Lys Ala Pro Phe Arg Asn Met Met Phe Tyr Gly Pro Pro Gly Thr Gly 
450 455 460 

30 Lys Thr Met Val Ala Arg Glu lie Ala Arg Lys Ser Gly Leu Asp Tyr 
465 470 475 480 

Ala Met Met Thr Gly Gly Asp w al Ala Pro Leu Gly Ala Gin Ala Val 

485 490 495 

Thr Lys lie His Glu lie Phe Asp Trp Ala Lys Lys Ser Asn Lys Gly 

3 5 ' 500 505 510 

Leu Leu Leu Phe lie Asp Glu Ala Asp Ala Phe Leu Cys Glu Arg Asn 

515 520 525 

Ser Thr Tyr Met Ser Glu Ala Gin Arg Ser Ala Leu Asn Ala Leu Leu 
530 535 540 

4 0 Phe Arg Thr Gly Asp Gin Ser Arg Asp He Val Leu Val Leu Ala Thr 

545 550 555 560 

Asn Arg Pro Gly Asp Leu Asp Ser Ala Val Thr Asp Arg He Asp Glu 

565 570 575 

Val He Glu Phe Pro Leu Pro Gly Glu Glu Glu Arg Phe Lys Leu Leu 
45 580 585 590 

Lys Leu Tyr Leu Asn Lys Tyr Leu Met Gly Asp Asp Lys Lys Gly Glu 

595 600 605 

Lys Asp Ser Asn Leu Lys Trp Ser Asn Leu Phe Lys Lys Lys Lys Ser 
610 ' 615 620 

50 Gin Lys He Thr He Glu Gly Asp Leu Thr Asp Gin Val He Lys Glu 
625 ' 630 635 640 

Ala Ala Lys Lys Thr Glu Gly Phe Ser Gly Arg Glu He Ala Lys Leu 

645 650 655 

Val Ala Gly Val Gin Ala Ala Val Tyr Gly Arg Gin Asp Cys Val Leu 
55 660 665 670 

Asp Ser Gin Leu Phe Glu Glu He Val Asp Tyr Lys He Glu Glu His 

675 680 685 

His Gin Arg He Arg Leu Ala Thr Glu Gly Gly Gin Ser Phe Pro 
690 695 700 

60 (2) INFORMATION FOR SEQ ID NO: 34 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 698 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. . 698 

( D) OTHER INFORMATION: / Ceres Seq. ID 1974421 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
10 Met Ser Thr Ser Ser Trp Thr Ala Arg Glu Asp Lys Gin Phe Glu Met 
15 10 15 

Ala Leu Ala Lys Phe Asp Lys Asp Thr Pro Asp Arg Trp Gin Lys lie 

20 ~ " 25 30 

Ala Arg Ala Val Gly Gly Lys Ser Thr Glu Glu Val Lys Arg His Tyr 
15 35 J 40 4 5 

Glu Leu Leu Leu Arg Asp Val Asn Asp lie Glu Ser Gly Arg Tyr Pro 

50 55 60 

Gin Pro Arg Leu Cys Ser Ala Ala Ala lie Ala Ala Ala Phe Thr Ser 
65 70 75 80 

2 0 Met Ser Met Ser Gin Asn Arg Ala Tyr Ala Asp Ser Arg Phe Arg Phe 

85 " 90 95 

Pro Phe Phe Ser Ser Ser Pro Pro Ala Glu Glu Ser Pro Thr Asp His 

100 105 110 

Lys Ser Ser Ser Asn Ser Lys Ser Glu Thr Lys Pro Asp Ser Asp Glu 
25 115 * 120 ~ 125 

Pro Lys Gly Ser Gly Phe Asp Pro Glu Ser Leu Glu Arg Gly Ala Lys 

130 135 140 

Ala Leu Arg Glu lie Asn Ser Ser Pro His Ser Lys Gin Val Phe Asp 
145 " 150 155 ^ 160 

3 0 Leu Met Arg Lys Gin Glu Lys Thr Arg Leu Ala Glu Leu Ala Ala Glu 

165 170 175 

Lys Glu His Asn Glu Ala lie Gin Ala Ser Lys Asp lie Glu Arg Gin 

180 185 " ^ 190 

Arg Lys Leu Ala Glu Asp Gin Arg Asn Leu Val Gin Gin Gin Ala Gin 
35 195 200 205 

Ala Lys Ala Gin Asn Leu Arg Tyr Glu Asp Glu Leu Ala Arg Lys Arg 

210 215 220 

Met Gin Thr Asp Asn Glu Ala Gin Arg Arg His Asn Ala Glu Leu Val 
225 230 235 240 

4 0 Ser Met Gin Glu Ala Ser Ser lie Arg Lys Glu Lys Ala Arg lie Ala 

245 250 255 

Thr Glu Glu Gin lie Gin Ala Gin Gin Arg Glu Thr Glu Lys Glu Arg 

260 265 270 

Ala Glu Leu Glu Arg Glu Thr lie Arg Val Lys Ala Met Ala Glu Ala 
45 275 280 285 

Glu Gly Arg Ala His Glu Ala Lys Leu Thr Glu Glu Gin Asn Arg Arg 

290 295 300 

Met Leu Leu Asp Lys lie Asn Gly Glu Arg Glu Lys Trp Leu Ala Ala 
305 ~ ~ 310 315 320 

5 0 lie Asn Thr Thr Phe Ser His lie Glu Gly Gly Val Arg Thr Leu Leu 

325 330 335 

Thr Asp Arg Ser Lys Leu lie Met Thr Val Gly Gly Val Thr Ala Leu 

340 345 350 

Ala Ala Gly Val Tyr Thr Thr Arg Glu Gly Ala Arg Val Thr Trp Gly 
55 355 360 365 

Tyr lie Asn Arg lie Leu Gly Gin Pro Ser Leu lie Arg Glu Ser Ser 

370 375 380 

Met Gly Arg Phe Pro Trp Ala Gly Ser Val Ser Gin Phe Lys Asn Lys 
385 ' 390 395 400 

60 Leu Ser Thr Ala Ala Gly Ala Ala Ala Ser Ala Glu Gly Glu Lys Pro 

405 410 415 

Leu Glu Asn Val lie Leu His Arg Ser Leu Lys Thr Arg lie Glu Arg 
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420 425 430 

Leu Ala Arg Ala Thr Ala Asn Thr Lys Ser His Lys Ala Pro Phe Arg 

435 440 445 

Asn Met Met Phe Tyr Gly Pro Pro Gly Thr Gly Lys Thr Met Val Ala 

5 450 455 460 

Arg Glu lie Ala Arg Lys Ser Gly Leu Asp Tyr Ala Met Met Thr Gly 
465 470 475 480 

Gly Asp Val Ala Pro Leu Gly Ala Gin Ala Val Thr Lys lie His Glu 
485 490 495 

10 lie Phe Asp Trp Ala Lys Lys Ser Asn Lys Gly Leu Leu Leu Phe lie 

500 505 510 

Asp Glu Ala Asp Ala Phe Leu Cys Glu Arg Asn Ser Thr Tyr Met Ser 

515 520 525 

Glu Ala Gin Arg Ser Ala Leu Asn Ala Leu Leu Phe Arg Thr Gly Asp 

15 530 535 540 

Gin Ser Arg Asp lie Val Leu Val Leu Ala Thr Asn Arg Pro Gly Asp 
545 550 555 560 

Leu Asp Ser Ala Val Thr Asp Arg lie Asp Glu Val lie Glu Phe Pro 
565 570 575 

2 0 Leu Pro Gly Glu Glu Glu Arg Phe Lys Leu Leu Lys Leu Tyr Leu Asn 

580 585 590 

Lys Tyr Leu Met Gly Asp Asp Lys Lys Gly Glu Lys Asp Ser Asn Leu 

595 "* 600 605 

Lys Trp Ser Asn Leu Phe Lys Lys Lys Lys Ser Gin Lys lie Thr lie 

25 610 615 620 

Glu Gly Asp Leu Thr Asp Gin Val lie Lys Glu Ala Ala Lys Lys Thr 
625 630 635 640 

Glu Gly Phe Ser Gly Arg Glu lie Ala Lys Leu Val Ala Gly Val Gin 
645 650 655 

30 Ala Ala Val Tyr Gly Arg Gin Asp Cys Val Leu Asp Ser Gin Leu Phe 

660 665 670 

Glu Glu lie Val Asp Tyr Lys lie Glu Glu His His Gin Arg lie Arg 

67 5 680 63 5 

Leu Ala Thr Glu Gly Gly Gin Ser Phe Pro 

35 690 695 

(2) INFORMATION FOR SEQ ID NO: 347: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 amino acids 

(B) TYPE: amino acid 
4 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

4 5 (B) LOCATION: 1..68 3 

(D) OTHER INFORMATION: / Ceres Seq. ID 1974422 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

Met Ala Leu Ala Lys Phe Asp Lys Asp Thr Pro Asp Arg Trp Gin Lys 
1 5 10 15 

50 lie Ala Arg Ala Val Gly Gly Lys Ser Thr Glu Glu Val Lys Arg His 

20 25 30 

Tyr Glu Leu Leu Leu Arg Asp Val Asn Asp lie Glu Ser Gly Arg Tvr 

35 40 45 

Pro Gin Pro Arg Leu Cys Ser Ala Ala Ala lie Ala Ala Ala Phe Thr 

55 50 55 60 

Ser Met Ser Met Ser Gin Asn Arg Ala Tyr Ala Asp Ser Arg Phe Arg 
65 70 75 80 

Phe Pro Phe Phe Ser Ser Ser Pro Pro Ala Glu Glu Ser Pro Thr Asp 
85 90 95 

60 His Lys Ser Ser Ser Asn Ser Lys Ser Glu Thr Lys Pro Asp Ser Asp 

100 105 110 

Glu Pro Lys Gly Ser Gly Phe Asp Pro Glu Ser Leu Glu Arg Gly Ala 
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115 120 125 

Lys Ala Leu Arg Glu lie Asn Ser Ser Pro His Ser Lys Gin Val Phe 

130 135 140 

Asp Leu Met Arg Lys Gin Glu Lys Thr Arg Leu Ala Glu Leu Ala Ala 
5 145 150 155 160 

Glu Lys Glu His Asn Glu Ala lie Gin Ala Ser Lys Asp lie Glu Arg 

165 170 175 

Gin Arg Lys Leu Ala Glu Asp Gin Arg Asn Leu Val Gin Gin Gin Ala 
180 185 190 

10 Gin Ala Lys Ala Gin Asn Leu A_rg Tyr Glu Asp Glu Leu Ala Arg Lys 
195 200 205 

Arg Met Gin Thr Asp Asn Glu Ala Gin Arg Arg His Asn Ala Glu Leu 

210 215 220 

Val Ser Met Gin Glu Ala Ser Ser lie Arg Lys Glu Lys Ala Arg lie 
15 225 230 235 240 

Ala Thr Glu Glu Gin lie Gin Ala Gin Gin Arg Glu Thr Glu Lys Glu 

245 250 255 

Arg Ala Glu Leu Glu Arg Glu Thr lie Arg Val Lys Ala Met Ala Glu 
260 265 270 

2 0 Ala Glu Gly Arg Ala His Glu Ala Lys Leu Thr Glu Glu Gin Asn Arg 
275 280 285 

Arg Met Leu Leu Asp Lys lie Asn Gly Glu Arg Glu Lys Trp Leu Ala 

290 295 300 

Ala lie Asn Thr Thr Phe Ser His lie Glu Gly Gly Val Arg Thr Leu 
25 305 310 315 320 

Leu Thr Asp Arg Ser Lys Leu lie Met Thr Val Gly Gly Val Thr Ala 

325 330 335 

Leu Ala Ala Gly Val Tyr Thr Thr Arg Glu Gly Ala Arg Val Thr Trp 
340 345 350 

30 Gly Tyr lie Asn Arg lie Leu Gly Gin Pro Ser Leu lie Arg Glu Ser 
355 360 365 

Ser Met Gly Arg Phe Pro Trp Ala Gly Ser Val Ser Gin Phe Lys Asn 

370 375 380 

Lys Leu Ser Thr Ala Ala Gly Ala Ala Ala Ser Ala Glu Gly Glu Lys 
35 385 390 " 395 " 400 

Pro Leu Glu Asn Val lie Leu His Arg Ser Leu Lys Thr Arg lie Glu 

405 410 415 

Arg Leu Ala Arg Ala Thr Ala Asn Thr Lys Ser His Lys Ala Pro Phe 
420 425 430 

4 0 Arg Asn Met Met Phe Tyr Gly Pro Pro Gly Thr Gly Lys Thr Met Val 
435 440 445 

Ala Arg Glu lie Ala Arg Lys Ser Gly Leu Asp Tyr Ala Met Met Thr 

450 455 460 

Gly Gly Asp Val Ala Pro Leu Gly Ala Gin Ala Val Thr Lys lie His 
45 465 470 475 480 

Glu lie Phe Asp Trp Ala Lys Lys Ser Asn Lys Gly Leu Leu Leu Phe 

485 490 495 

lie Asp Glu Ala Asp Ala Phe Leu Cys Glu Arg Asn Ser Thr Tyr Met 
500 505 510 

50 Ser Glu Ala Gin Arg Ser Ala Leu Asn Ala Leu Leu Phe Arg Thr Gly 
515 520 525 

Asp Gin Ser Arg Asp lie Val Leu Val Leu Ala Thr Asn Arg Pro Gly 

530 535 540 

Asp Leu Asp Ser Ala Val Thr Asp Arg lie Asp Glu Val lie Glu Phe 
55 545 550 555 560 

Pro Leu Pro Gly Glu Glu Glu Arg Phe Lys Leu Leu Lys Leu Tyr Leu 

565 570 575 

Asn Lys Tyr Leu Met Gly Asp Asp Lys Lys Gly Glu Lys Asp Ser Asn 
580 585 590 

60 Leu Lys Trp Ser Asn Leu Phe Lys Lys Lys Lys Ser Gin Lys lie Thr 
595 600 605 

lie Glu Gly Asp Leu Thr Asp Gin Val lie Lys Glu Ala Ala Lys Lys 
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610 615 620 

Thr Glu Gly Phe Ser Gly Arg Glu lie Ala Lys Leu Val Ala Gly Val 
625 630 635 640 

Gin Ala Ala Val Tyr Gly Arg Gin Asp Cys Val Leu Asp Ser Gin Leu 
5 645 650 655 

Phe Glu Glu lie Val Asp Tyr Lys lie Glu Glu His His Gin Arg lie 

660 ' 665 670 

Arg Leu Ala Thr Glu Gly Gly Gin Ser Phe Pro 
675 680 
10 (2) INFORMATION FOR SEQ ID NO: 34 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 953 

20 (D) OTHER INFORMATION: / Ceres Seq. ID 1975983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 
aaaaaagata aaacaaaagc tgaaaggagg aaaccagagg aaggagacga cgaccaccac 
cacaacgcat ctcagctctt ccttaatcca tggcagacaa gggccgcccg ttgccgaagt 
ttggaagagc tgagatgcgt tgtggtggtg gcgtcgcctc cttcttcaat ctcatcgcaa 

25 tcatgatcat atccgagact aatcgccgtg agatctccaa gtacctcttc aaagagggtg 
ttttgtttgc caaaaaggat ttcaatttac cacaacatcc tttgattgag agtgttccaa 
atetgeaagt tatcaagttg atgcagagtt tcaaatctaa ggaatatgtg agagagacct 
ttgcttggat gcattactac tggttcctca caaatgaagg tattgacttt cttaggactt 
accttaatct cccatctgag attgttcctg ctactctgaa gaagcaacag aagcctcttg 

30 gtcgaccttt tggaggtggt ggtgaccgtc cccgtggccc tcctcgtggt gatggagaga 
ggaggtttgg tgacagagat ggataccgtg gaggtcctaa atcaggtgga gagtatggtg 
acaaggctgg agcacctgct gattaccagc ctggcttcag gggtggagct agtggagcaa 
ggcaagggtt tggtcgtgga gctggtggtt ttggtggtgg tgctggtcca getgetggat 
ctgatctacc ttgaaaagga ctttcttgtt tctttttggt cttatttaag gttacatagc 

35 accttattga gaacgaatgt gtcttttgga actttgtttc tttctcttaa accatttcac 
aatatgattg tttttttacg ttagaatttt atgaagaaat gttggatcgt tacatgaatc 
tttttacact gtcctccttt ttaaattctc ctatttagtt tttttcgaag ctt 
(2) INFORMATION FOR SEQ ID NO: 34 9: 
(i) SEQUENCE CHARACTERISTICS: 

40 (A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .214 

(D) OTHER INFORMATION: / Ceres Seq. ID 1975984 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
50 Met Ala Asp Lys Gly Arg Pro Leu Pro Lys Phe Gly Arg Ala Glu Met 
1 5 " 10 15 

Arg Cys Gly Gly Gly Val Ala Ser Phe Phe Asn Leu lie Ala lie Met 

20 25 30 

lie lie Ser Glu Thr Asn Arg Arg Glu lie Ser Lys Tyr Leu Phe Lys 
55 35 40 45 

Glu Gly Val Leu Phe Ala Lys Lys Asp Phe Asn Leu Pro Gin His Pro 

50 55 60 

Leu lie Glu Ser Val Pro Asn Leu Gin Val lie Lys Leu Met Gin Ser 
65 70 75 80 

60 Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp Met His Tyr 

85 90 95 

Tyr Trp Phe Leu Thr Asn Glu Gly lie Asp Phe Leu Arg Thr Tyr Leu 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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100 










105 










110 






Asn 


Leu 


Pro 


Ser 


Glu 


lie 


Val 


Pro 


Ala 


Thr 


Leu 


Lys 


Lys 


Gin 


Gin 


Lys 






115 










120 










125 








Pro 


Leu 


Gly 


Arg 


Pro 


Phe 


Gly 


Gly 


Gly 


Gly 


Asp 


Arg 


Pro 


Arg 


Gly 


Pro 




130 










135 










140 










Pro 


Arg 


Gly 


Asp 


Gly 


Glu 


Arg 


Arg 


Phe 


Gly 


Asp 


Arg 


Asp 


Gly 


Tyr 


Arg 


14 5 










150 










ICC 

1 J J 










160 


Gly 


Gly 


Pro 


Lys 


Ser 


Gly 


Gly 


Glu 


Tyr 


Gly 


Asp 


Lys 


Ala 


Gly 


Ala 


Pro 










165 










170 










175 




Ala 


Asp 


Tyr 


Gin 


Pro 


Gly 


Phe 


Arg 


Gly 


Gly 


Ala 


Ser 


Gly 


Ala 


Arg 


Gin 








180 










185 










190 






Gly 


Phe 


Gly 


Arg 


Gly 


Ala 


Gly 


Gly 


Phe 


Gly 


Gly 


Gly 


Ala 


Gly 


Pro 


Ala 






195 










200 










205 








Ala 


Gly 


Ser 


Asp 


Leu 


Pro 























15 210 

(2} INFORMATION FOR SEQ ID NO: 350: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 
2 0 (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
25 (B) LOCATION: 1..199 

(D) OTHER INFORMATION: / Ceres Seq. ID 1975985 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 





Met 


Arg 


Cys 


Gly 


Gly 


Gly 


Val 


Ala 


Ser 


Phe 


Phe 


Asn 


Leu 


He 


Ala 


He 




1 








5 










10 










15 




30 


Met 


He 


He 


Ser 
20 


Glu 


Thr 


Asn 


Arg 


Arg 

25 


Glu 


He 


Ser 


Lys 


Tyr 

30 


Leu 


Phe 




Lys 


Glu 


Gly 
35 


Val 


Leu 


Phe 


Ala 


Lys 
40 


Lys 


Asp 


Phe 


Asn 


Leu 
45 


Pro 


Gin 


His 




Pro 


Leu 


He 


Glu 


Ser 


Val 


Pro 


Asn 


Leu 


Gin 


Val 


He 


Lys 


Leu 


Met 


Gin 


35 




50 










55 










60 












Ser 


Phe 


Lys 


Ser 


Lys 


Glu 


Tyr 


Val 


Arg 


Glu 


Thr 


Phe 


Ala 


Trp 


Met 


His 




65 










70 










75 










80 




Tyr 


Tyr 


Trp 


Phe 


Leu 
85 


Thr 


Asn 


Glu 


Gly 


He 
90 


Asp 


Phe 


Leu 


Arg 


Thr 

95 


Tyr 


40 


Leu 


Asn 


Leu 


Pro 
100 


Ser 


Glu 


He 


Val 


Pro 
105 


Ala 


Thr 


Leu 


Lys 


Lys 
110 


Gin 


Gin 




Lys 


Pro 


Leu 
115 


Gly 


Arg 


Pro 


Phe 


Gly 
120 


Gly 


Gly 


Gly 


Asp 


Arg 
125 


Pro 


Arg 


Gly 




Pro 


Pro 


Arg 


Gly 


Asp 


Gly 


Glu 


Arg 


Arg 


Phe 


Gly 


Asp 


Arg 


Asp 


Gly 


Tyr 


45 




130 










135 










140 












Arg 


Gly 


Gly 


Pro 


Lys 


Ser 


Gly 


Gly 


Glu 


Tyr 


Gly 


Asp 


Lys 


Ala 


Gly 


Ala 




145 










150 










155 










160 




Pro 


Ala 


Asp 


Tyr 


Gin 
165 


Pro 


Gly 


Phe 


Arg 


Gly 
170 


Gly 


Ala 


Ser 


Gly 


Ala 
175 


Arg 


50 


Gin 
Ala 


Gly 
Ala 


Phe 

Gly 
195 


Gly 
180 
Ser 


Arg 
Asp 


Gly 
Leu 


Ala 
Pro 


Gly 


Gly 
185 


Phe 


Gly 


Gly 


Gly 


Ala 
190 


Gly 


Pro 




(2) 


INFORMATION 


FOR 


SEQ 


ID NO:351: 

















55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
60 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 
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(B) LOCATION: 1..183 

(D) OTHER INFORMATION: / Ceres Seq. ID 1975986 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 
Met lie lie Ser Glu Thr Asn Arg Arg Glu lie Ser Lys Tyr Leu Phe 
5 1 5 10 15 

Lys Glu Gly Val Leu Phe Ala Lys Lys Asp Phe Asn Leu Pro Gin His 

20 25 30 

Pro Leu lie Glu Ser Val Pro Asn Leu Gin Val lie Lys Leu Met Gin 
35 40 45 

10 Ser Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp Met His 
50 : 55 60 

Tyr Tyr Trp Phe Leu Thr Asn Glu Gly lie Asp Phe Leu Arg Thr Tyr 
65 7 0 7 5 8 0 

Leu Asn Leu Pro Ser Glu lie Val Pro Ala Thr Leu Lys Lys Gin Gin 
15 85 90 95 

Lys Pro Leu Gly Arg Pro Phe Gly Gly Gly Gly Asp Arg Pro Arg Gly 

100 105 110 

Pro Pro Arg Gly Asp Gly Glu Arg Arg Phe Gly Asp Arg Asp Gly Tyr 
115 120 125 

2 0 Arg Gly Gly Pro Lys Ser Gly Gly Glu Tyr Gly Asp Lys Ala Gly Ala 
130 135 140 

Pro Ala Asp Tyr Gin Pro Gly Phe Arg Gly Gly Ala Ser Gly Ala Arg 
145 150 155 160 

Gin Gly Phe Gly Arg Gly Ala Gly Gly Phe Gly Gly Gly Ala Gly Pro 
25 * 165 170 175 

Ala Ala Gly Ser Asp Leu Pro 
180 

(2) INFORMATION FOR SEQ ID NO: 352: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (ix) FEATURE: 

{A) NAME/KEY: - 

(B) LOCATION: 1..1027 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976019 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

4 0 atcgaaatta gggtttcgcg ttaggagaag aagttaaagc aaaacacata caaacgcagt 60 
caccttctct gtcgcctcct tcttcaatct catcgcaatc atgatcatat ccgagactaa 120 
tcgccgtgag atctccaagt acctcttcaa agagggtgtt ttgtttgcca aaaaggattt 180 
caatttacca caacatcctt tgattgagag tgttccaaat ctgcaagtta tcaagttgat 240 
gcagagtttc aaatctaagg aatatgtgag agagaccttt gcttggatgc attactactg 300 

45 gttcctcaca aatgaaggta ttgactttct taggacttac cttaatctcc catctgagat 360 
tgttcctgct actctgaaga agcaacagaa gcctcttggt cgaccttttg gaggtggtgg 420 
tgaccgtccc cgtggccctc ctcgtggtga tggagagagg aggtttggtg acagagatgg 480 
ataccgtgga ggtcctaaat caggtggaga gtatggtgac aaggctggag cacctgctga 540 
ttaccagcct ggcttcaggg gtggagctag tggagcaagg caagggtttg gtcgtggagc 600 

50 tggtggtttt ggtggtggtg ctggtccagc tgctggatct gatctacctt gaaaaggaga 660 
caatacagct cagccaaagc caaggccttt atcaccttac acaatgtacg cggacatgaa 720 
gcctccaaca tcaccacttc catctccagt caccaatcat tagctatgtg agacgagaat 780 
tggcctccta aaccctggag cttcttctta attgcaggtt gtaaacaatg agtaagagag 840 
tgatggggca attcagtttt gcaggtatga atcagtgagc ttatttgtac aataatacaa 900 

55 tattcatcaa catgccttat atatgagttc tacttttttt ctcctgttca acgatcaatg 960 
cagtaaccat attgttaccc tatctcatac atatatgatg acccaaatta attatcatta 1020 
ggctt tc 

(2) INFORMATION FOR SEQ ID NO:353: 
(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: peptide 
5 (B) LOCATION: 1..183 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976020 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 
Met lie lie Ser Glu Thr Asn Arg Arg Glu lie Ser Lys Tyr Leu Phe 
15 10 15 

10 Lys Glu Gly Val Leu Phe Ala Lys Lys Asp Phe Asn Leu Pro Gin His 

20 25 30 

Pro Leu He Glu Ser Val Pro Asn Leu Gin Val He Lys Leu Met Gin 

35 40 45 

Ser Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp Met His 
15 50 ' 55 60 

Tyr Tyr Trp Phe Leu Thr Asn Glu Gly He Asp Phe Leu Arg Thr Tyr 
65 70 75 80 

Leu Asn Leu Pro Ser Glu He Val Pro Ala Thr Leu Lys Lys Gin Gin 
85 90 95 

20 Lys Pro Leu Gly Arg Pro Phe Gly Gly Gly Gly Asp Arg Pro Arg Gly 

100 105 110 

Pro Pro Arg Gly Asp Gly Glu Arg Arg Phe Gly Asp Arg Asp Gly Tyr 

115 120 125 

Arg Gly Gly Pro Lys Ser Gly Gly Glu Tyr Gly Asp Lys Ala Gly Ala 
25 130 135 140 

Pro Ala Asp Tyr Gin Pro Gly Phe Arg Gly Gly Ala Ser Gly Ala Arg 
145 150 155 160 

Gin Gly Phe Gly Arg Gly Ala Gly Gly Phe Gly Gly Gly Ala Gly Pro 
165 170 175 

30 Ala Ala Gly Ser Asp Leu Pro 

180 

(2) INFORMATION FOR SEQ ID NO: 35 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 0 (A) NAME/KEY: peptide . 

(B) LOCATION: 1 . - 137 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976021 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 
Met Gin Ser Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp 
45 1 5 10 15 

Met His Tyr Tyr Trp Phe Leu Thr Asn Glu Gly He Asp Phe Leu Arg 

2 0 2 5 30 

Thr Tyr Leu Asn Leu Pro Ser Glu He Val Pro Ala Thr Leu Lys Lys 
35 4 0 45 

50 Gin Gin Lys Pro Leu Gly Arg Pro Phe Gly Gly Gly Gly Asp Arg Pro 
50 55 60 

Arg Gly Pro Pro Arg Gly Asp Gly Glu Arg Arg Phe Gly Asp Arg Asp 
65 70 75 80 

Gly Tyr Arg Gly Gly Pro Lys Ser Gly Gly Glu Tyr Gly Asp Lys Ala 

5 5 85 90 95 

Gly Ala Pro Ala Asp Tyr Gin Pro Gly Phe Arg Gly Gly Ala Ser Gly 

100 105 110 

Ala Arg Gin Gly Phe Gly Arg Gly Ala Gly Gly Phe Gly Gly Gly Ala 
115 120 125 

60 Gly Pro Ala Ala Gly Ser Asp Leu Pro 
130 135 
(2) INFORMATION FOR SEQ ID NO: 355: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..121 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976022 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 



Met 


His 


Tyr 


Tyr 


Trp 


Phe 


Leu 


Thr 


Asn 


Glu 


Gly 


He 


Asp Phe 


Leu 


Arg 


1 








5 










10 








15 




Thr 


Tyr 


Leu 


Asn 


Leu 


Pro 


Ser 


Glu 


lie 


Val 


Pro 


Ala 


Thr Leu 


Lys 


Lys 








20 










25 








30 






Gin 


Gin 


Lys 


Pro 


Leu 


Gly 


Arg 


Pro 


Phe 


Gly 


Gly 


Gly 


Gly Asp 


Arg 


Pro 






35 










40 










45 






Arg 


Gly 


Pro 


Pro 


Arg 


Gly 


Asp 


Gly 


Glu 


Arg 


Arg 


Phe 


Gly Asp 


Arg 


Asp 




50 










55 










60 








Glv 


Tyr 


Arg 


Gly 


Gly 


Pro 


Lys 


Ser 


Gly 


Gly 


Glu 


Tyr 


Gly Asp 


Lys 


Ala 


65 










70 










75 








80 


Gly 


Ala 


Pro 


Ala 


Asp 


Tyr 


Gin 


Pro 


Gly 


Phe 


Arg 


Gly 


Gly Ala 


Ser 


Gly 










85 










90 








95 




Ala 


Arg 


Gin 


Gly 


Phe 


Gly 


Arg 


Gly 


Ala 


Gly 


Gly 


Phe 


Gly Gly 


Gly 


Ala 








100 










105 








110 






Gly 


Pro 


Ala 


Ala 


Gly 


Ser 


Asp 


Leu 


Pro 


















115 










120 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:356: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 478 

(D) OTHER INFORMATION : / Ceres Seq. ID 1976673 
(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 356: 



aaattcaatc tcttccaatt tctcttcttc ttcctcgcga cgcgttaatg gcggcttcgt 60 

tctcactcac gagcttcatc tccttcatct caccattcaa atctcaaacc aaacctacac 120 

caccaccaaa tctcactctt ccttctccaa ctatctccca aaggcgaaga aatgatctcg 180 

ctatcgaatc aatggcggtc gaagaatctt cttcaaccgc ttcttcactt tcctctgagc 240 

ttgcttctgt gatatgcccc tcgcttgctt actccaacac gctcttcttc agttctggat 300 

acaatgtgca agtgtttgtt gaagataacg agtcagagga gaggcttgtg aatcgattta 360 

ggagagaagt gatgagaact ggtgttatac aggaatgtaa gaggagaaga tactttgaga 420 



ataaacaaga tgagaagaaa cgtaggactc gtgatgctgc taagcgtaat aagaaaag 
(2) INFORMATION FOR SEQ ID NO: 357: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..158 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976674 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 
He Gin Ser Leu Pro lie Ser Leu Leu Leu Pro Arg Asp Ala Leu Met 
15 10 15 

Ala Ala Ser Phe Ser Leu Thr Ser Phe lie Ser Phe He Ser Pro Phe 
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20 25 30 

Lys Ser Gin Thr Lys Pro Thr Pro Pro Pro Asn Leu Thr Leu Pro Ser 

35 40 45 

Pro Thr lie Ser Gin Arg Arg Arg Asn Asp Leu Ala lie Glu Ser Met 

50 55 60 

Ala Val Glu Glu Ser Ser Ser Thr Ala Ser Ser Leu Ser Ser Glu Leu 
65 70 75 80 

Ala Ser Val lie Cys Pro Ser Leu Ala Tyr Ser Asn Thr Leu Phe Phe 

85 90 95 

Ser Ser Gly Tyr Asn Val Gin Val Phe Val Glu A.sp Asn Glu Ser Glu 

100 105 110 

Glu Arg Leu Val Asn Arg Phe Arg Arg Glu Val Met Arg Thr Gly Val 

115 120 125 

lie Gin Glu Cys Lys Arg Arg Arg Tyr Phe Glu Asn Lys Gin Asp Glu 

130 135 140 

Lys Lys Arg Arg Thr Arg Asp Ala Ala Lys Arg Asn Lys Lys 
145 ' ' 150 155 

(2) INFORMATION FOR SEQ ID NO: 358: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) ST HANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 143 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976675 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



Met 


Ala 


Ala 


Ser 


Phe 


Ser 


Leu 


Thr 


Ser 


Phe 


He 


Ser 


Phe 


He 


Ser 


Pro 


1 








5 










10 










15 




Phe 


Lys 


Ser 


Gin 


Thr 


Lys 


Pro 


Thr 


Pro 


Pro 


Pro 


Asn 


Leu 


Thr 


Leu 


Pro 








20 










25 










30 






Ser 


Pro 


Thr 


He 


Ser 


Gin 


Arg 


Arg 


Arg 


Asn 


Asp 


Leu 


Ala 


He 


Glu 


Ser 






35 










40 










45 








Met 


Ala 


Val 


Glu 


Glu 


Ser 


Ser 


Ser 


Thr 


Ala 


Ser 


Ser 


Leu 


Ser 


Ser 


Glu 




50 










55 










60 










Leu 


Ala 


Ser 


Val 


He 


Cys 


Pro 


Ser 


Leu 


Ala 


Tyr 


Ser 


Asn 


Thr 


Leu 


Phe 


65 










70 










75 










80 


Phe 


Ser 


Ser 


Gly 


Tyr 


Asn 


Val 


Gin 


Val 


Phe 


Val 


Glu 


Asp 


Asn 


Glu 


Ser 










85 










90 










95 




Glu 


Glu 


Arg 


Leu 


Val 


Asn 


Arg 


Phe 


Arg 


Arg 


Glu 


Val 


Met 


Arg 


Thr 


Gly 








100 










105 










110 






Val 


He 


Gin 


Glu 


Cys 


Lys 


Arg 


Arg 


Arg 


Tyr 


Phe 


Glu 


Asn 


Lys 


Gin 


Asp 






115 










120 










125 








Glu 


Lys 


Lys 


Arg 


Arg 


Thr 


Arg 


Asp 


Ala 


Ala 


Lys 


Arg 


Asn 


Lys 


Lys 





130 135 140 

(2) INFORMATION FOR SEQ ID NO: 359: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : peptide 

(B) LOCATION: 1 . . 95 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976676 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
Met Ala Val Glu Glu Ser Ser Ser Thr Ala Ser Ser Leu Ser Ser Glu 
15 10 15 

Leu Ala Ser Val He Cys Pro Ser Leu Ala Tyr Ser Asn Thr Leu Phe 
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^ U 










9 R 










J U 






Fne 


Ser Ser Gly 


Tyr 


Asn 






vax 


pne 


val 


(j_LU 


Asp 


Asn 


Glu 


Ser 




JJ 








40 










A ^ 








Glu 


Glu Arg Leu 


Val 


Asn 


Arg 


Phe 


Arg 


Arg 


Glu 


Val 


Met 


Arg 


Thr 


Gly 




50 






55 










60 










Val 


lie Gin Glu 


Cys 


Lys 


Arg 


Arg 


Arg 


Tyr 


Phe 


Glu 


Asn 


Lys 


Gin 


Asp 


65 






70 










75 










80 


Glu 


Lys Lys Arg 


Arg 


Thr 


Arg 


Asp 


Ala 


Ala 


Lys 


Arg 


Asn 


Lys 


Lys 








85 










90 










95 




(2) 


INFORMATION 


FOR 


SEQ 


ID NO:360: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 107 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025186 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



aatcatttgc agaagaaaaa taaaaaaaaa aaacagagaa aaggagata~ ggaatcaatc 60 

ggagtcctta tgatgtgccc catgtcctcc tacctcgaga acgagcttga gaagcgcttc 120 

aaccttcttc gcttctggac ttctccggag aaatccgtcc tcctagaaac tcatcggaac 180 

tccatccgcg ccgtcgttgg gaatgcttct gccggcgccg atgctcagct catcagcgat 240 

ctgcccaacc ttgagattgt atccagcttc agcgtcgggc tcgacaagat cgatttgggg 300 

aaatgcaaag aaaaagggat ccgcgtcacc aacacccccg acgttctcac cgaagacgtc 360 

gcagatctcg ccatcgggct tatcctggct ctccttcgac ggctgtgtga gtgcgatcgc 420 

tatgtaagga gcggaaaatg gaagcaaggt gaattccaac tcactaccaa gtttagtgga 480 

aaatccgtgg ggatcattgg tctaggtaga attgggactg ccatcgcaaa gagggctgaa 540 

gcctttagct gcccaatcaa ttactactca agaaccatta agcctgatgt cgcctacaag 600 

tattatccga cggtggttga ccttgctcaa aactcagaca tcctcgtcgt cgcatgcccg 660 

ttgaccgagc agaccagaca cattgtggac cggcaggtca tggatgcatt aggagctaag 720 

ggcgtcctca taaacattgg ccgtggacca catgttgatg agcaagagct tattaaagct 780 

ctaacagaag gccgcctagg tggggctgcc cttgatgtgt ttgagcagga gcctcacgtg 840 

cccgaggagc tctttggcct tgagaatgta gttctcctcc ctcacgttgg gagtggcact 900 

gtggaaacac ggaatgccat ggccgatctt gtcgtgggta acttggaagc gcacttttct 960 

gggaaatcac ttctgactcc ggtcgtctga gtcatgtgac acatttggga tttggtttca 1020 



cttcatttga aaggttttta tgttaacgaa ctcatgaaaa gggtaatttc tcttcc 
(2) INFORMATION FOR SEQ ID NO: 361: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .32 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025187 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 



Asn 


His 


Leu 


Gin 


Lys 


Lys 


Asn 


Lys 


Lys 


Lys 


Lys 


Gin 


Arg 


liVS 


Gly 


Asp 


1 








5 










10 










15 




Met 


Glu 


Ser 


He 


Gly 


Val 


Leu 


Met 


Met 


Cys 


Pro 


Met 


Ser 


Ser 


Tyr 


Leu 








20 










25 










30 






Glu 


Asn 


Glu 


Leu 


Glu 


Lys 


Arg 


Phe 


Asn 


Leu 


Leu 


Arg 


Phe 


Trp 


Thr 


Ser 






35 










40 










45 








Pro 


Glu 


Lys 


Ser 


Val 


Leu 


Leu 


Glu 


Thr 


His 


Arg 


Asn 


Ser 


He 


Arg 


Ala 




50 










55 










60 










Val 


Val 


Gly 


Asn 


Ala 


Ser 


Ala 


Gly 


Ala 


Asp 


Ala 


Gin 


Leu 


He 


Ser 


Asp 


65 










70 










75 










80 


Leu 


Pro 


Asn 


Leu 


Glu 


He 


Val 


Ser 


Ser 


Phe 


Ser 


Val 


Gly 


Leu 


Asp 


Lys 
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85 90 95 

lie Asp Leu Gly Lys Cys Lys Glu Lys Gly lie Arg Val Thr Asn Thr 

100 105 110 

Pro Asp Val Leu Thr Glu Asp Val Ala Asp Leu Ala lie Gly Leu lie 
5 115 120 125 

Leu Ala Leu Leu Arg Arg Leu Cys Glu Cys Asp Arg Tyr Val Arg Ser 

130 135 ^ 14 0 

Gly Lys Trp Lys Gin Gly Glu Phe Gin Leu Thr Thr Lys Phe Ser Gly 
145 150 155 160 

10 Lys Ser Val Gly lie lie Gly Leu Gly Arg lie Gly Thr Ala lie Ala 

165 170 175 

Lys Arg Ala Glu Ala Phe Ser Cys Pro lie Asn Tyr Tyr Ser Arg Thr 

180 185 190 

lie Lys Pro Asp Val Ala Tyr Lys Tyr Tyr Pro Thr Val Val Asp Leu 
15 ~ 195 ^ 200 ^ 2 05 

Ala Gin Asn Ser Asp lie Leu Val Val Ala Cys Pro Leu Thr Glu Gin 

210 215 220 

Thr Arg His lie Val Asp Arg Gin Val Met Asp Ala Leu Gly Ala Lys 
225 230 235 240 

2 0 Gly Val Leu He Asn He Gly Arg Gly Pro His Val Asp Glu Gin Glu 

245 250 255 

Leu He Lys Ala Leu Thr Glu Gly Arg Leu Gly Gly Ala Ala Leu Asp 

260 265 270 

Val Phe Glu Gin Glu Pro His Val Pro Glu Glu Leu Phe Gly Leu Glu 
25 275 280 285 

Asn Val Val Leu Leu Pro His Val Gly Ser Gly Thr Val Glu Thr Arg 

290 295 300 

Asn Ala Met Ala Asp Leu Val Val Gly Asn Leu Glu Ala His Phe Ser 
305 310 315 320 

30 Gly Lys Ser Leu Leu Thr Pro Val Val 

325 

(2) INFORMATION FOR SEQ ID NO: 362: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 0 (A) NAME /KEY : peptide 

(B) LOCATION: 1.-313 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025188 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:362: 
Met Glu Ser lie Gly Val Leu Met Met Cys Pro Met Ser Ser Tyr Leu 
45 1 5 10 15 

Glu Asn Glu Leu Glu Lys Arg Phe Asn Leu Leu Arg Phe Trp Thr Ser 

20 25 30 

Pro Glu Lys Ser Val Leu Leu Glu Thr His Arg Asn Ser lie Arg Ala 
35 40 45 

50 Val Val Gly Asn Ala Ser Ala Gly Ala Asp Ala Gin Leu lie Ser Asp 
50 55 60 

Leu Pro Asn Leu Glu lie Val Ser Ser Phe Ser Val Gly Leu Asp Lys 
65 70 75 80 

lie Asp Leu Gly Lys Cys Lys Glu Lys Gly lie Arg Val Thr Asn Thr 
55 85 90 95 

Pro Asp Val Leu Thr Glu Asp Val Ala Asp Leu Ala lie Gly Leu lie 

100 105 110 

Leu Ala Leu Leu Arg Arg Leu Cys Glu Cys Asp Arg Tyr Val Arg Ser 
115 120 125 

60 Gly Lys Trp Lys Gin Gly Glu Phe Gin Leu Thr Thr Lys Phe Ser Gly 
130 " 135 140 

Lys Ser Val Gly lie lie Gly Leu Gly Arg lie Gly Thr Ala lie Ala 
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145 










150 










155 










160 


Lys 


Arg 


Ala 


Glu 


Ala 


Phe 


Ser 


Cys 


Pro 


lie 


Asn 


Tyr 


Tyr 


Ser 


Arg 


Thr 










165 










170 










175 




lie 


Lys 


Pro 


Asp 


Val 


Ala 


Tyr 


Lys 


Tyr 


Tyr 


Pro 


Thr 


Val 


Val 


Asp 


Leu 








180 










185 










190 






Ala 


Gin 


Asn 


Ser 


Asp 


lie 


Leu 


Val 


Val 


Ala 


Cys 


Pro 


Leu 


Thr 


Glu 


Gin 






195 










200 










205 








Thr 


Arg 


His 


lie 


Val 


Asp 


Arg 


Gin 


Val 


Met 


Asp 


Ala 


Leu 


Gly 


Ala 


Lys 




210 










215 










220 










Gly 


Val 


Leu 


lie 


Asn 


lie 


Gly 


Arg 


Gly 


Pro 


His 


Val 


Asp 


Glu 


Gin 


Glu 


225 










230 










235 










240 


Leu 


lie 


Lys 


Ala 


Leu 


Thr 


Glu 


Gly 


Arg 


Leu 


Gly 


Gly 


Ala 


Ala 


Leu 


Asp 










24 5 










250 










255 




Val 


Phe 


Glu 


Gin 


Glu 


Pro 


His 


Val 


Pro 


Glu 


Glu 


Leu 


Phe 


Gly 


Leu 


Glu 








260 










265 










270 






Asn 


Val 


Val 


Leu 


Leu 


Pro 


His 


Val 


Gly 


Ser 


Gly 


Thr 


Val 


Glu 


Thr 


Arg 






275 










280 










285 








Asn 


Ala 


Met 


Ala 


Asp 


Leu 


Val 


Val 


Gly 


Asn 


Leu 


Glu 


Ala 


His 


Phe 


Ser 




290 










295 










300 










Gly 


Lys 


Ser 


Leu 


Leu 


Thr 


Pro 


Val 


Val 

















305 310 
(2) INFORMATION FOR SEQ ID NO: 3 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

30 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . .30 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025189 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 





Met 


Met 


Cys 


Pro 


Met 


Ser 


Ser 


Tyr 


Leu 


Glu 


Asn 


Glu 


Leu 


Glu 


Lys 


Arg 


35 


1 








5 










10 










15 






Phe 


Asn 


Leu 


Leu 


Arg 


Phe 


Trp 


Thr 


Ser 


Pro 


Glu 


Lys 


Ser 


Val 


Leu 


Leu 










20 










25 










30 








Glu 


Thr 


His 


Arg 


Asn 


Ser 


He 


Arg 


Ala 


Val 


Val 


Gly 


Asn 


Ala 


Ser 


Ala 








35 










40 










45 








40 


Gly 


Ala 


Asp 


Ala 


Gin 


Leu 


He 


Ser 


Asp 


Leu 


Pro 


Asn 


Leu 


Glu 


He 


Val 






50 










55 










60 












Ser 


Ser 


Phe 


Ser 


Val 


Gly 


Leu 


Asp 


Lys 


He 


Asp 


Leu 


Gly 


Lys 


Cys 


Lys 




65 










70 










75 










80 




Glu 


Lys 


Gly 


He 


Arg 


Val 


Thr 


Asn 


Thr 


Pro 


Asp 


Val 


Leu 


Thr 


Glu 


Asp 


45 










85 










90 










95 






Val 


Ala 


Asp 


Leu 


Ala 


He 


Gly 


Leu 


He 


Leu 


Ala 


Leu 


Leu 


Arg 


Arg 


Leu 










100 










105 










110 








Cys 


Glu 


Cys 


Asp 


Arg 


Tyr 


Val 


Arg 


Ser 


Gly 


Lys 


Trp 


Lys 


Gin 


Gly 


Glu 








115 










120 










125 








50 


Phe 


Gin 


Leu 


Thr 


Thr 


Lys 


Phe 


Ser 


Gly 


Lys 


Ser 


Val 


Gly 


He 


He 


Gly 






130 










135 










140 












Leu 


Gly 


Arg 


He 


Gly 


Thr 


Ala 


He 


Ala 


Lys 


Arg 


Ala 


Glu 


Ala 


Phe 


Ser 




145 










150 










155 










160 




Cys 


Pro 


lie 


Asn 


Tyr 


Tyr 


Ser 


Arg 


Thr 


He 


Lys 


Pro 


Asp 


Val 


Ala 


Tyr 


55 










165 










170 










175 






Lys 


Tyr 


Tyr 


Pro 


Thr 


Val 


Val 


Asp 


Leu 


Ala 


Gin 


Asn 


Ser 


Asp 


He 


Leu 










180 










185 










190 








Val 


Val 


Ala 


Cys 


Pro 


Leu 


Thr 


Glu 


Gin 


Thr 


Arg 


His 


He 


Val 


Asp 


Arg 








195 










200 










205 








60 


Gin 


Val 


Met 


Asp 


Ala 


Leu 


Gly 


Ala 


Lys 


Gly 


Val 


Leu 


He 


Asn 


He 


Gly 






210 










215 










220 












Arg 


Gly 


Pro 


His 


Val 


Asp 


Glu 


Gin 


Glu 


Leu 


He 


Lys 


Ala 


Leu 


Thr 


Glu 
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225 230 235 240 

Gly Arg Leu Gly Gly Ala Ala Leu Asp Val Phe Glu Gin Glu Pro His 

245 250 255 

Val Pro Glu Glu Leu Phe Gly Leu Glu Asn Val Val Leu Leu Pro His 
5 260 265 270 

Val Gly Ser Gly Thr Val Glu Thr Arg Asn Ala Met Ala Asp Leu Val 

275 280 285 

Val Gly Asn Leu Glu Ala His Phe Ser Gly Lys Ser Leu Leu Thr Pro 
290 295 " 300 

10 Val Val 
305 

(2) INFORMATION FOR SEQ ID NO: 364: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
20 (A) NAME /KEY : - 

(B) LOCATION: 1 . . 555 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025372 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:364: 
aaaactcact ttcacttgca caaagataag gaaaccatgt ctgtgtcagc gatctttggt 60 

25 accggaatcg tcaccgttgc tgcttctccg gttctccgcc aatttcaagt tccaaaattg 120 
ggtaatggag gaggattagg gatggtgata gagtgttcgt cgaggccaca gaagaaatcg 180 
acggctcatc acaggaagac gaggccgaag aagactcagc cttgggacat taagagaaag 240 
cctactgttt acgctcctct tcctcctctt ccggcggaat ggagtccgtt cactctcgct 300 
tctgacgacg gtggtgctgc cactgctgcg ggagatttgg tttcaggcgc tgcctagttg 360 

30 gtatgagtta tctgctgatt tggttgtaat ctagtttgag aacttttgtt gttgtttact 420 
gtgttttcag ttttctgatg tttgttgttg ggtttgtttg tgtttgattg gtgaatgaat 480 
ggaccaaatt tgcgatttat aaagcttcaa cctttctcct ctgtttttga gtttaaaggt 540 
ctcaacttta tagtt 

(2) INFORMATION FOR SEQ ID NO: 365: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1..118 

(D) OTHER INFORMATION: ./ Ceres Seq. ID 2025373 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

Lys Thr His Phe His Leu His Lys Asp Lys Glu Thr Met Ser Val Ser 
15 10 15 

Ala lie Phe Gly Thr Gly lie Val Thr Val Ala Ala Ser Pro Val Leu 
20 25 30 

50 Arg Gin Phe Gin Val Pro Lys Leu Gly Asn Gly Gly Gly Leu Gly Met 
35 40 45 

Val lie Glu Cys Ser Ser Arg Pro Gin Lys Lys Ser Thr Ala His His 

50 " 55 60 

Arg Lys Thr Arg Pro Lys Lys Thr Gin Pro Trp Asp lie Lys Arg Lys 
55 65 70 75 80 

Pro Thr Val Tyr Ala Pro Leu Pro Pro Leu Pro Ala Glu Trp Ser Pro 

85 90 95 

Phe Thr Leu Ala Ser Asp Asp Gly Gly Ala Ala Thr Ala Ala Gly Asp 
100 105 110 

60 Leu Val Ser Gly Ala Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 366: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..106 

10 (D) OTHER INFORMATION: / Ceres Seq. ID 2025374 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:366: 
Met Ser Val Ser Ala lie Phe Gly Thr Gly lie Val Thr Val Ala Ala 
15 10 15 

Ser Pro Val Leu Arg Gin Phe Gin Val Pro Lys Leu Gly Asn Gly Gly 
15 20 2 5 30 

Gly Leu Gly Met Val lie Glu Cys Ser Ser Arg Pro Gin Lys Lys Ser 

35 4 0 45 

Thr Ala His His Arg Lys Thr Arg Pro Lys Lys Thr Gin Pro Trp Asp 
50 55 60 

20 He Lys Arg Lys Pro Thr Val Tyr Ala Pro Leu Pro Pro Leu Pro Ala 
65 70 75 80 

Glu Trp Ser Pro Phe Thr Leu Ala Ser Asp Asp Gly Gly Ala Ala Thr 

85 90^ * 95 

Ala Ala Gly Aso Leu Val Ser Gly Ala Ala 
25 1 100 105 

(2) INFORMATION FOR SEQ ID NO: 3 67: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
30 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
35 (B) LOCATION: 1..71 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025375 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 
Met Val He Glu Cys Ser Ser Arg Pro Gin Lys Lys Ser Thr Ala His 
15 10 15 

4 0 His Arg Lys Thr Arg Pro Lys Lys Thr Gin Pro Trp Asp lie Lys Arg 

20 25 30 

Lys Pro Thr Val Tyr Ala Pro Leu Pro Pro Leu Pro Ala Glu Trp Ser 

^35 4 0 4 5 

Pro Phe Thr Leu Ala Ser Asp Asp Gly Gly Ala Ala Thr Ala Ala Gly 
45 50 55 60 

Asp Leu Val Ser Gly Ala Ala 
65 7 0 

(2) INFORMATION FOR SEQ ID NO: 368: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
55 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 631 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025471 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 
60 atgtcgcgaa agttcctaaa accgccacgt gtcagtcact tatccaacag taatccaatt 60 
attcgtcaag cctccggttt attaccgtct aaaccgaaaa gaaaaaactc tcttcggaat 120 
ctgaaaagcc ataaatcagc agaaaaaaat cttaaggatc ggaaatttta taatcggatg 180 
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aatagagtga ttagtcagtt ttccggtaag ttgatgaagg agaagagtgt taccggtgta 240 

agcggcggag atacgacttt aagatctgcc gtcaaagaat cggtctcgtc tccacagagt 300 

gcgtcgtcgt cgtcgtcggt gaggaggctt aaaggagatc tggaatcgag tcgatttggt 360 

gcggcggcga gtgagaggct gaggcaagcg gaggaatctt tgaggacggt gatgtttctg 420 

agctgttggg gatcttgtta gatctggttg agaaaataat agatgagaaa acgaaaccaa 480 

aacgtttctg tttgtgtttt tgagttttgg atttttgttt tctctgtgaa tagttttttt 540 

tttttttttc gttttctttg tactttgtgt tttatgtaaa tcatgtataa aatgaaaagt 600 



aaatcaatga agataatggt ttgtcttcac c 
(2) INFORMATION FOR SEQ ID NO: 369: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 14 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025472 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:369: 



Met 


Ser 


Arg 


Lys 


Phe 


Leu 


Lys 


Pro 


Pro 


Arg 


Val 


Ser 


His 


Leu 


Ser 


Asn 


i 








5 










10 










15 




Ser 


Asn 


Pro 


lie 


lie 


Arg 


Gin 


Ala 


Ser 


Gly 


Leu 


Leu 


Pro 


Ser 


Lys 


Pro 








20 










25 










30 






Lys 


Arg 


Lys 


Asn 


Ser 


Leu 


Arg 


Asn 


Leu 


Lys 


Ser 


His 


Lys 


Ser 


Ala 


Glu 






35 










40 










45 








Lys 


Asn 


Leu 


Lys 


Asp 


Arg 


Lys 


Phe 


Tyr 


Asn 


Arg 


Met 


Asn 


Arg 


Val 


lie 




50 










55 










60 










Ser 


Gin 


Phe 


Ser 


Gly 


Lys 


Leu 


Met 


Lys 


Glu 


Lys 


Ser 


Val 


Thr 


Gly 


Val 


65 










70 










75 










80 


Ser 


Gly 


Gly 


Asp 


Thr 


Thr 


Leu 


Arg 


Ser 


Ala 


Val 


Lys 


Glu 


Ser 


Val 


Ser 










85 










90 










95 




Ser 


Pro 


Gin 


Ser 


Ala 


Ser 


Ser 


Ser 


Ser 


Ser 


Val 


Arg 


Arg 


Leu 


Lys 


Gly 








100 










105 










110 






Asp 


Leu 


Glu 


Ser 


Ser 


Arg 


Phe 


Gly 


Ala 


Ala 


Ala 


Ser 


Glu 


Arg 


Leu 


Arg 






115 










120 










125 








Gin 


Ala 


Glu 


Glu 


Ser 


Leu 


Arg 


Thr 


Val 


Met 


Phe 


Leu 


Ser 


Cys 


Trp 


Gly 



130 135 140 

Ser Cys 
145 

(2) INFORMATION FOR SEQ ID NO: 370': 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. . 87 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025473 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



Met Asn 


Arg 


Val 


He 


Ser 


Gin 


Phe 


Ser 


Gly 


Lys 


Leu 


Met Lys 


Glu 


Lys 


1 






5 










10 








15 




Ser Val 


Thr 


Gly 


Val 


Ser 


Gly 


Gly 


Asp 


Thr 


Thr 


Leu 


Arg Ser 


Ala 


Val 






20 










25 








30 






Lys Glu 


Ser 


Val 


Ser 


Ser 


Pro 


Gin 


Ser 


Ala 


Ser 


Ser 


Ser Ser 


Ser 


Val 




35 










40 










45 






Arg Arg 


Leu 


Lys 


Gly 


Asp 


Leu 


Glu 


Ser 


Ser 


Arg 


Phe 


Gly Ala 


Ala 


Ala 


50 










55 










60 








Ser Glu 


Arg 


Leu 


Arg 


Gin 


Ala 


Glu 


Glu 


Ser 


Leu 


Arg 


Thr Val 


Met 


Phe 


65 








70 










75 








80 
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Leu Ser Cys Trp Gly Ser Cys 
8 5 

(2) INFORMATION FOR SEQ ID NO: 371: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 7 5 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025474 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 
15 Met Lys Glu Lys Ser Val Thr Gly Val Ser Gly Gly Asp Thr Thr Leu 
15 10 15 

Arg Ser Ala Val Lys Glu Ser Val Ser Ser Pro Gin Ser Ala Ser Ser 

20 25 30 

Ser Ser Ser Val Arg Arg Leu Lys Gly Asp Leu Glu Ser Ser Arg Phe 
20 35 40 45 

Gly Ala Ala Ala Ser Glu Arg Leu Arg Gin Ala Glu Glu Ser Leu Arg 

50 55 60 

Thr Val Met Phe Leu Ser Cys Trp Gly Ser Cys 
65 70 75 

25 (2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..2029 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 2025475 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 
atgggtaaga gttggttttc agctgtgaag aaagcattaa gcccagaacc aaaacaaaag 60 
aaagagcaga agccacataa gtccaagaaa tggtttggta aatccaagaa gctagatgtt 120 
actaattctg gtgcagcata ttctcctcgt actgtcaaag acgcgaaact aaaggagatt 180 

40 gaggagcaac agagcagaca tgcttactct gtggctattg caactgctgc agctgcagag 240 
gcagccgttg cagctgctca agctgctgct gaagttgttc gtctctctgc attatcacgg 300 
ttccccggga aatcaatgga agagatcgcc gctatcaaga ttcagacagc atttagagga 360 
tatatggcaa gaagagcatt gcgtgcgttg agaggtcttg tgaggctaaa atctttagtc 420 
caggggaaat gtgtgagacg tcaagccaca tctacattgc aaagcatgca aacactagct 480 

4 5 agagtacaat atcagattcg tgagagaagg ctccgattgt ctgaggataa acaggcttta 540 
acacgacagc tccaacaaaa acacaataaa gactttgata agactggaga aaattggaat 600 
gatagtacat tgtcgcggga gaaagttgaa gcaaacatgt tgaacaagca agtagcaaca 660 
atgagaagag aaaaagcgct tgcatatgca ttcagtcacc agaatacatg gaaaaactca 720 
actaaaatgg gttctcaaac attcatggac cctaacaatc cgcattgggg ttggagttgg 780 

50 ctagaacgtt ggatggctgc tcgaccaaac gaaaaccact cactcacacc agataatgct 840 
gaaaaagact cttctgctag gagtgtagca agccgtgcca tgtctgagat gattccacga 900 
ggcaaaaacc tttcaccaag aggaaagaca ccaaacagtc gaagagggtc aagcccgaga 960 
gtgaggcaag tcccaagtga agactcaaac agcattgtga gtttccaatc agaacaacct 1020 
tgcaatcgta ggcatagcac ttgtggatca attccatcaa ctagagacga tgaaagcttc 1080 

55 accagtagtt tctctcagtc agttccaggc tacatggcac ctacacaagc cgccaaagca 1140 
agagctcgat tctcaaacct tagtcctcta agctcagaga agacagcgaa aaaacggctt 1200 
tccttctcgg gatctcctaa gactgtaaga cggttttcag tagggattct agggctttcc 1260 
ttcattggtc agatcagacg acgttttaca tcttcttctt cttcctcttc gatattcgtc 1320 
agtgtgtgta ttttggggaa aactttgtga gcaaagagcg agaaaatgag cggaccggta 1380 

60 agaaaatcgc ggatgtggct ttcaaagctt cgaggactat cgattgggat ggtatggcta 1440 
aggtccttgt cacagatgag gctcgtagag agttctctaa ccttcgtcgt gctttcgatg 1500 
aggttaacac acagctccag accaaattta gtcaggaacc tgaacccata gattgggatt 1560 
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actataggaa gggtattgga gctggcattg ttgacaagta caaggaagct tatgacagca 
ttgagattcc aaagtacgtt gacaaagtta ctcctgaata caagccaaag tttgatgctt 
tgttggtgga actgaaagaa gcagaacaga aatcgctcaa ggagtctgaa cggttggaga 
aagaaattgc tgatgtccaa gagatcagca aaaagctcag caccatgact gcagatgagt 
actttgagaa gcacccggaa ctcaaaaaga agtttgatga cgaaatccgt aatgacaact 
ggggatactg atcatgtttc tccatctccg gcttggaaag aaaactctct ttctctttct 
ctgttctctt actgtgattt tgtgagccaa tcataacaat aataagtaca ccattcactt 
aagcagtgtt gagatcttca ttccaaggaa gataaacgca tttggtttt 
(2) INFORMATION FOR SEQ ID NO: 373: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



(xi) 



MOLECULE TYPE 
FEATURE : 

(A) NAME/KEY 

(B) LOCATION 



peptide 
1 . . 449 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025476 
SEQUENCE DESCRIPTION: SEQ ID NO:373: 





Met 


Gly 


Lys 


Ser 


Trp 


Phe 


Ser 


Ala 


Val 


Lys 


Lys 


Ala 


Leu 


Ser 


Pro 


Glu 




1 








5 










10 










15 






Pro 


Lys 


Gin 


Lys 


Lys 


Glu 


Gin 


Lys 


Pro 


His 


Lys 


Ser 


Lys 


Lys 


Trp 


Phe 










20 










25 










30 






25 


Gly 


Lys 


Ser 


Lys 


Lys 


Leu 


Asp 


Val 


Thr 


Asn 


Ser 


Gly 


Ala 


Ala 


Tyr 


Ser 








35 










40 










45 










Pro 


Arg 


Thr 


Val 


Lys 


Asp 


Ala 


Lys 


Leu 


Lys 


Glu 


He 


Glu 


Glu 


Gin 


Gin 






50 










55 










60 












Ser 


Arg 


His 


Ala 


Tyr 


Ser 


Val 


Ala 


He 


Ala 


Thr 


Ala 


Ala 


Ala 


Ala 


Glu 


30 


65 










70 










75 










80 




Ala 


Ala 


Val 


Ala 


Ala 


Ala 


Gin 


Ala 


Ala 


Ala 


Glu 


Val 


Val 


Arg 


Leu 


Ser 












85 










90 










95 






Ala 


Leu 


Ser 


Arg 


Phe 


Pro 


Gly 


Lys 


Ser 


Met 


Glu 


Glu 


He 


Ala 


Ala 


He 










100 










105 










110 






35 


Lys 


He 


Gin 


Thr 


Ala 


Phe 


Arg 


Gly 


Tyr 


Met 


Ala 


Arg 


Arg 


Ala 


Leu 


Arg 








115 










120 










125 










Ala 


Leu 


Arg 


Gly 


Leu 


Val 


Arg 


Leu 


Lys 


Ser 


Leu 


Val 


Gin 


Gly 


Lys 


Cys 






130 










135 










140 












Val 


Arg 


Arg 


Gin 


Ala 


Thr 


Ser 


Thr 


Leu 


Gin 


Ser 


Met 


Gin 


Thr 


Leu 


Ala 


40 


145 










150 










155 










160 




Arg 


Val 


Gin 


Tyr 


Gin 


He 


Arg 


Glu 


Arg 


Arg 


Leu 


Arg 


Leu 


Ser 


Glu 


Asp 












165 










170 










175 






Lys 


Gin 


Ala 


Leu 


Thr 


Arg 


Gin 


Leu 


Gin 


Gin 


Lys 


His 


Asn 


Lys 


Asp 


Phe 










180 










185 










190 






45 


Asp 


Lys 


Thr 


Gly 


Glu 


Asn 


Trp 


Asn 


Asp 


Ser 


Thr 


Leu 


Ser 


Arg 


Glu 


Lys 








195 










200 










205 










Val 


Glu 


Ala 


Asn 


Met 


Leu 


Asn 


Lys 


Gin 


Val 


Ala 


Thr 


Met 


Arg 


Arg 


Glu 






210 










215 










220 












Lys 


Ala 


Leu 


Ala 


Tyr 


Ala 


Phe 


Ser 


His 


Gin 


Asn 


Thr 


Trp 


Lys 


Asn 


Ser 


50 


225 










230 










235 










240 




Thr 


Lys 


Met 


Gly 


Ser 


Gin 


Thr 


Phe 


Met 


Asp 


Pro 


Asn 


Asn 


Pro 


His 


Trp 












245 










250 










255 






Gly 


Trp 


Ser 


Trp 


Leu 


Glu 


Arg 


Trp 


Met 


Ala 


Ala 


Arg 


Pro 


Asn 


Glu 


Asn 










260 










265 










270 






55 


His 


Ser 


Leu 


Thr 


Pro 


Asp 


Asn 


Ala 


Glu 


Lys 


Asp 


Ser 


Ser 


Ala 


Arg 


Ser 








275 










280 










285 










Val 


Ala 


Ser 


Arg 


Ala 


Met 


Ser 


Glu 


Met 


He 


Pro 


Arg 


Gly 


Lys 


Asn 


Leu 






290 










295 










300 












Ser 


Pro 


Arg 


Gly 


Lys 


Thr 


Pro 


Asn 


Ser 


Arg 


Arg 


Gly 


Ser 


Ser 


Pro 


Arg 


60 


305 










310 










315 










320 




Val 


Arg 


Gin 


Val 


Pro 


Ser 


Glu 


Asp 


Ser 


Asn 


Ser 


He 


Val 


Ser 


Phe 


Gin 












325 










330 










335 





1620 
1680 
1740 
1800 
1860 
1920 
1980 
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c; a r- 


Pin 
Kj± U 


bin 


Pro 


Cy s 


As n 


Arg 


Ar g 


Hie: 
rl 1 S 


Q o r* 


1 IIX. 




PI \7 

biy 


Q o r- 
J JU 


T 1 t=± 

lie 


Pro 




Ocl 


i nr 


Arg 
355 


Asp 


Asp 


Pin 
ol U 


oer 


rfie 

360 


T H -r 






Php 

•Lilt! 


Otrl 

365 


PI n 


Q O T 


v a i 


o 


D -v- 

IT I/O 


370 


lyr 


i/ien 


a 1 n 


Pro 


l nr 

375 


Pin 
bill 


7V 1 


a 1 ^ 


Lys 


A 1 a 

380 


A ttt 
jri-L y 


Al a 
-H.1 ci 




Phn 






Asn 


Leu 


Ser 


Pro 


Leu 


Ser 


Ser 


Pin 


Lys 


Th r- 

i nr 


n l n 
Aia 


Lys 


Lys 


Arg 


lieu 




385 










390 










395 










400 




Ser 


Phe 


Ser 


Gly 


Ser 


Pro 


Lys 


Thr 


Val 


Arg 


Arg 


Phe 


Ser 


Val 


Gly 


He 


10 










405 










410 










415 






Leu 


Gly 


Leu 


Ser 
420 


Phe 


lie 


Gly 


Gin 


He 
425 


Arg 


Arg 


Arg 


Phe 


Thr 
430 


Ser 


Ser 




Ser 


Ser 


Ser 
435 


Ser 


Ser 


He 


Phe 


Val 
440 


Ser 


Val 


Cys 


He 


Leu 
445 


Gly 


Lys 


Thr 



1 5 Leu 



(2) INFORMATION FOR SEQ ID NO: 37 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 4 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

25 (A) NAME /KEY : peptide 

(B) LOCATION: 1..34 4 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025477 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 
Met Glu Glu He Ala Ala He Lys He Gin Thr Ala Phe Arg Gly Tyr 
30 1 5 10 15 

Met Ala Arg Arg Ala Leu Arg Ala Leu Arg Gly Leu Val Arg Leu Lys 

20 25 30 

Ser Leu Val Gin Gly Lys Cys Val Arg Arg Gin Ala Thr Ser Thr Leu 
35 AO 4 5 

35 Gin Ser Met Gin Thr Leu Ala Arg Val Gin Tyr Gin He Arg Glu Arg 
50 55 60 

Arg Leu Arg Leu Ser Glu Asp Lys Gin Ala Leu Thr Arg Gin Leu Gin 
65 " 70 75 80 

Gin Lys His Asn Lys Asp Phe Asp Lys Thr Gly Glu Asn Trp Asn Asp 
4 0 8 5 90 95 

Ser Thr Leu Ser Arg Glu Lys Val Glu Ala Asn Met Leu Asn Lys Gin 

100 " 105 110 

Val Ala Thr Met Arg Arg Glu Lys Ala Leu Ala Tyr Ala Phe Ser His 
115 120 125 

4 5 Gin Asn Thr Trp Lys Asn Ser Thr Lys Met Gly Ser Gin Thr Phe Met 
130 ~ ~ 135 140 

Asp Pro Asn Asn Pro His Trp Gly Trp Ser Trp Leu Glu Arg Trp Met 
145 150 155 160 

Ala Ala Arg Pro Asn Glu Asn His Ser Leu Thr Pro Asp Asn Ala Glu 
50 165 170 175 

Lys Asp Ser Ser Ala Arg Ser Val Ala Ser Arg Ala Met Ser Glu Met 

180 185 190 

He Pro Arg Gly Lys Asn Leu Ser Pro Arg Gly Lys Thr Pro Asn Ser 
195 200 205 

55 Arg Arg Gly Ser Ser Pro Arg Val Arg Gin Val Pro Ser Glu Asp Ser 
210 215 220 

Asn Ser He Val Ser Phe Gin Ser Glu Gin Pro Cys Asn Arg Arg His 
225 230 235 240 

Ser Thr Cys Gly Ser He Pro Ser Thr Arg Asp Asp Glu Ser Phe Thr 
60 " 245 250 255 

Ser Ser Phe Ser Gin Ser Val Pro Gly Tyr Met Ala Pro Thr Gin Ala 
260 265 270 
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Ala Lys Ala Arg Ala Arg Phe Ser Asn Leu Ser Pro Leu Ser Ser Glu 

275 280 285 

Lys Thr Ala Lys Lys Arg Leu Ser Phe Ser Gly Ser Pro Lys Thr Val 
290 295 300 

5 Arg Arg Phe Ser Val Gly lie Leu Gly Leu Ser Phe lie Gly Gin lie 
305 310 315 320 

Arg Arg Arg Phe Thr Ser Ser Ser Ser Ser Ser Ser lie Phe Val Ser 

325 330 335 

Val Cys lie Leu Gly Lys Thr Leu 
10 340 

(2) INFORMATION FOR SEQ ID NO: 375: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
20 (B) LOCATION: 1..32 8 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025478 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:375: 
Met Ala Arg Arg Ala Leu Arg Ala Leu Arg Gly Leu Val Arg Leu Lys 
1 5 10 15 

2 5 Ser Leu Val Gin Gly Lys Cys Val Arg Arg Gin Ala Thr Ser Thr Leu 

20 25 30 

Gin Ser Met Gin Thr Leu Ala Arg Val Gin Tyr Gin lie Arg Glu Arg 

35 40 45 

Arg Leu Ara Leu Ser Glu Asp Lys Gin Ala Leu Thr Arg Gin Leu Gin 
30 50 55 60 

Gin Lys His Asn Lys Asp Phe Asp Lys Thr Gly Glu Asn Trp Asn Asp 
65 70 75 80 

Ser Thr Leu Ser Arg Glu Lys Val Glu Ala Asn Met Leu Asn Lys Gin 
85 90 95 

35 Val Ala Thr Met Arg Arg Glu Lys Ala Leu Ala Tyr Ala Phe Ser His 

100 105 110 

Gin Asn Thr Trp Lys Asn Ser Thr Lys Met Gly Ser Gin Thr Phe Met 

115 ~ 120 125 

Asp Pro Asn Asn Pro His Trp Gly Trp Ser Trp Leu Glu Arg Trp Met 
40 130 135 140 

Ala Ala Arg Pro Asn Glu Asn His Ser Leu Thr Pro Asp Asn Ala Glu 
145 150 155 160 

Lys Asp Ser Ser Ala Arg Ser Val Ala Ser Arg Ala Met Ser Glu Met 
165 170 175 

4 5 lie Pro Arg Gly Lys Asn Leu Ser Pro Arg Gly Lys Thr Pro Asn Ser 

180 185 190 

Arg Arg Gly Ser Ser Pro Arg Val Arg Gin Val Pro Ser Glu Asp Ser 

195 200 205 

Asn Ser lie Val Ser Phe Gin Ser Glu Gin Pro Cys Asn Arg Arg His 
50 210 215 220 

Ser Thr Cys Gly Ser lie Pro Ser Thr Arg Asp Asp Glu Ser Phe Thr 
225 230 235 240 

Ser Ser Phe Ser Gin Ser Val Pro Gly Tyr Met Ala Pro Thr Gin Ala 
245 250 255 

55 Ala Lys Ala Arg Ala Arg Phe Ser Asn Leu Ser Pro Leu Ser Ser Glu 

260 265 270 

Lys Thr Ala Lys Lys Arg Leu Ser Phe Ser Gly Ser Pro Lys Thr Val 

275 280 285 

Arg Arg Phe Ser Val Gly lie Leu Gly Leu Ser Phe lie Gly Gin lie 
60 290 295 300 

Arg Arg Arg Phe Thr Ser Ser Ser Ser Ser Ser Ser lie Phe Val Ser 
305 310 315 320 
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Val Cys lie Leu Gly Lys Thr Leu 
325 

(2) INFORMATION FOR SEQ ID NO: 376: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 67 8 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025524 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 



gattttttag ggtttcaagt gaaaagagta atagcgcggc ggagccatgg ttctcaagac 60 

tgagctttgc cgattcagtg gccagaaaat ttaccctggt agagggatca gatttatccg 120 

atcggactct caggtgtttt tgtttctcaa ctccaaatgt aagaggtatt tccacaacaa 180 

gttgaagcca tctaagcttt gctggactgc tatgtaccga aagcagcaca agaaggacgc 240 

agcacaagag gctgtgaaga gaaggagacg tgcaactaag aagccttact caaggtcgat 300 

tgtcggtgct actttggagg ttattcagaa gaagcgagca gagaagcctg aagttcgrga 360 

tgccgctaga gaagctgccc tacgtgagat caaggagaga atcaagaaga ccaaggacga 420 

gaagaaggca aagaaggtcg agtatgcatc aaagcaacag aagtcacaag tgaagggaaa 480 

tatccccaag agtgctgcac ccaaggctgc taagatgggt ggtggtggag gcagacgttg 540 

aatggagcta tagagtagcc cactcttctc tcttcactta tctttctttc ttgttttgac 600 

attgttttgt tttgtcagcc attttttagt tttgcaccag atctaatata ttcagtttat 660 



gaaaactttt tgtttggc 

(2) INFORMATION FOR SEQ ID NO: 377: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..164 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025525 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



Met 


Val 


Leu 


Lys 


Thr 


Glu 


Leu 


Cys 


Arg 


Phe 


Ser 


Gly 


Gin 


Lys 


He 


Tyr 


1 








5 










10 










15 




Pro 


Gly 


Arg 


Gly 


He 


Arg 


Phe 


He 


Arg 


Ser 


Asp 


Ser 


Gin 


Val 


Phe 


Leu 








20 










25 










30 






Phe 


Leu 


Asn 


Ser 


Lys 


Cys 


Lys 


Arg 


Tyr 


Phe 


His 


Asn 


Lys 


Leu 


Lys 


Pro 






35 










40 










45 








Ser 


Lys 


Leu 


Cys 


Trp 


Thr 


Ala 


Met 


Tyr 


Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 




50 










55 










60 










Ala 


Ala 


Gin 


Glu 


Ala 


Val 


Lys 


Arg 


Arg 


Arg 


Arg 


Ala 


Thr 


Lys 


Lys 


Pro 


65 










70 










75 










80 


Tyr 


Ser 


Arg 


Ser 


He 


Val 


Gly 


Ala 


Thr 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Lys 










85 










90 










95 




Arg 


Ala 


Glu 


Lys 


Pro 


Glu 


Val 


Arg 


Asp 


Ala 


Ala 


Arg 


Glu 


Ala 


Ala 


Leu 








100 










105 










110 






Arg 


Glu 


He 


Lys 


Glu 


Arg 


He 


Lys 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 






115 










120 










125 








Lys 


Lys 


Val 


Glu 


Tyr 


Ala 


Ser 


Lys 


Gin 


Gin 


Lys 


Ser 


Gin 


Val 


Lys 


Gly 




130 










135 










140 










Asn 


He 


Pro 


Lys 


Ser 


Ala 


Ala 


Pro 


Lys 


Ala 


Ala 


Lys 


Met 


Gly 


Gly 


Gly 


145 










150 










155 










160 


Gly 


Gly 


Arg 


Arg 


























(2) 


INFORMATION 


FOR 


SEQ 


ID 1 


NO: 378 : 

















(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 109 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025526 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

Met Tyr Arg Lys Gin His Lys Lys Asp Ala Ala Gin Glu Ala Val Lys 
1 5 10 15 

Arg Arg Arg Arg Ala Thr Lys Lys Pro Tyr Ser Arg Ser lie Val Gly 
20 25 30 

15 Ala Thr Leu Glu Val lie Gin Lys Lys Arg Ala Glu Lys Pro Glu Val 
35 40 45 

Arg Asp Ala Ala Arg Glu Ala Ala Leu Arg Glu lie Lys Glu Arg lie 

50 ^ 55 60 

Lys Lys Thr Lys Asp Glu Lys Lys Ala Lys Lys Val Glu Tyr Ala Ser 
20 65 ' 70 ~ 75 80 

Lys Gin Gin Lys Ser Gin Val Lys Gly Asn lie Pro Lys Ser Ala Ala 

8 5 90 ' 95 

Pro Lys Ala Ala Lys Met Gly Gly Gly Gly Gly Arg Arg 
100 105 
25 (2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..1623 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 2025544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:379: 
atgggcattt ctaagatgat tcttctgaaa ttcacatatg ttgtttcttt ctggattact 60 
tgcgtgatct tcaaaatgtc aagaacgata aaacgagctt tgttgggctc attcatcttc 120 
attcttgctt ccgctagtgt tgtggtagcc gccattgtag gagccatcga aggtcacacc 180 

4 0 actgacatcg gatttcttca aggtagtgtg cttggagtgg tggctggagt tatcactgcg 240 
gtccaactct ttggaccggt gctacatagt gatcaacctt tgtctaaggt ggctttactc 300 
aggagagtag tgaatgggaa agccattatg ggattggtta gaccttttgt tctcaaagca 360 
tatcaatggc aaataatagc attggataca agttacatgg agagttcaaa cttatacgat 420 
ttcaatcatg aaaaaaaggg actatcgaag agctctattc agaacatccc gatgttctac 480 

4 5 aaccgttcag aacatcaaac aaaatcgagt tgctcgattt gcttacagga ttgggaagaa 540 
ggggaagtag gaagaaagct agcaagatgt ggccacacat ttcatatgaa ttgcatagat 600 
gagtggttgc ttagacaaga aacttgcccc atttgaagaa gacaaaagaa gaagcacgag 660 
aatcatctgc ttaaaatcca atcgcatcga ctcagtcttc ttcttcttct tcacctacct 720 
ttctctctct ctctcgtggt ccattcgcta ttctatactt cttgtcggcg ccggagagag 780 

50 tgagatgact tggtcggttt tcagatccat aaatactcca acactcgacc tctccaccgc 840 
acttcgctcc actcgtaccc cattggtcgc tgctggtgtc ggctgcgcaa cattcgctgg 900 
tgtttctctc ttcagaatgt cttctagatc tcctcctttc gcttccctct ctgtctctgc 960 
ttcttctgtg aagaaggaag ttgtgtctac tgagaaagca ccagctgctt tgggacctta 1020 
ctctcaggcc attaaagcca ataatctggt ttttctttca ggtgttcttg gacttatacc 1080 

55 tgagactgga aagtttgttt cggagagcgt cgaagatcag actgagcagg tactcaaaaa 1140 
catgggggag atattgaaag ctagtggtgc tgattattcc tcggtggtga agacaacaat 1200 
catgttggct gatttggctg acttcaagac agtgaacgag atatatgcca aatacttccc 1260 
agctccttct ccagcacgat cgacgtatca agttgcagct ttgcctctaa acgccaagat 1320 
cgagattgaa tgtattgcaa cactctagaa cactcaaatc aatctctacg aagggtattt 1380 

60 tggtccggaa gatgtcaaaa caaataagag aaacaatggt ttttggggct taacccatat 1440 
ttaacctact tttcattttg ctgcactacg aaaaatgttg cggattatca accattaggt 1500 
tggttacaaa tcaaaagcct ctaatcaaga ataagttatt tagtgtcttg tccaccctca 1560 
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tttgctgttg acgtatatgt attgattgta tacattattt tatttgagtg aaaccattat 1620 
get 

(2) INFORMATION FOR SEQ ID NO: 380: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1. .24 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025545 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
15 Met Ser Gly Cys Leu Asp Lys Lys Leu Ala Pro Phe Glu Glu Asp Lys 
15 10 15 

Arg Arg Ser Thr Arg lie lie Cys Leu Lys Ser Asn Arg lie Asp Ser 

20 25 30 

Val Phe Phe Phe Phe Phe Thr Tyr Leu Ser Leu Ser Leu Ser Trp Ser 
20 35 40 45 

lie Arg Tyr Ser lie Leu Leu Val Gly Ala Gly Glu Ser Glu Met Thr 

50 55 60 

Trp Ser Val Phe Arg Ser lie Asn Thr Pro Thr Leu Asp Leu Ser Thr 
65 7 0 7 5 8 0 

2 5 Ala Leu Arg Ser Thr Arg Thr Pro Leu Val Ala Ala Gly Val Gly Cys 

85 90 95 

Ala Thr Phe Ala Gly Val Ser Leu Phe Arg Met Ser Ser Arg Ser Pro 

100 105 110 

Pro Phe Ala Ser Leu Ser Val Ser Ala Ser Ser Val Lys Lys Glu Val 
30 115 120 125 

Val Ser Thr Glu Lys Ala Pro Ala Ala Leu Gly Pro Tyr Ser Gin Ala 

130 135 140 

lie Lys Ala Asn Asn Leu Val Phe Leu Ser Gly Val Leu Gly Leu lie 
145 150 155 160 

35 Pro Glu Thr Gly Lys Phe Val Ser Glu Ser Val Glu Asp Gin Thr Glu 

165 170 175 

Gin Val Leu Lys Asn Met Gly Glu lie Leu Lys Ala Ser Gly Ala Asp 

180 185 190 

Tyr Ser Ser Val Val Lys Thr Thr lie Met Leu Ala Asp Leu Ala Asp 
40 195 200 205 

Phe Lys Thr Val Asn Glu lie Tyr Ala Lys Tyr Phe Pro Ala Pro Ser 

210 215 220 

Pro Ala Arg Ser Thr Tyr Gin Val Ala Ala Leu Pro Leu Asn Ala Lys 
225 230 235 240 

4 5 He Glu He Glu Cys He Ala Thr Leu 

245 

(2) INFORMATION FOR SEQ ID NO: 381: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

55 (A) NAME/KEY: peptide 

(B) LOCATION: 1..211 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025546 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 
Met Gly He Ser Lys Met He Leu Leu Lys Phe Thr Tyr Val Val Ser 
60 1 5 10 15 

Phe Trp He Thr Cys Val He Phe Lys Met Ser Arg Thr He Lys Arg 
20 25 30 
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Ala Leu Leu Gly Ser Phe lie Phe lie Leu Ala Ser Ala Ser Val Val 

35 40 45 

Val Ala Ala lie Val Gly Ala lie Glu Gly His Thr Thr Asp lie Gly 
50 55 60 

5 Phe Leu Gin Gly Ser Val Leu Gly Val Val Ala Gly Val lie Thr Ala 
65 70 75 80 

Val Gin Leu Phe Gly Pro Val Leu His Ser Asp Gin Pro Leu Ser Lys 

85 90 95 

Val Ala Leu Leu Arg Arg Val Val Asn Gly Lys Ala He Met Gly Leu 
10 100 105 110 

Val Arg Pro Phe Val Leu Lys Ala Tyr Gin Trp Gin He He Ala Leu 

115 120 125 

Asp Thr Ser Tyr Met Glu Ser Ser Asn Leu Tyr Asp Phe Asn His Glu 
130 135 140 

15 Lys Lys Gly Leu Ser Lys Ser Ser He Gin Asn He Pro Met Phe Tyr 
145 150 155 160 

Asn Arg Ser Glu His Gin Thr Lys Ser Ser Cys Ser He Cys Leu Gin 

165 170 175 

Asp Trp Glu Glu Gly Glu Val Gly Arg Lys Leu Ala Arg Cys Gly His 
20 180 185 190 

Thr Phe His Met Asn Cys He Asp Glu Trp Leu Leu Arg Gin Glu Thr 

195 200 205 

Cys Pro He 
210 

25 (2) INFORMATION FOR SEQ ID NO:382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 206 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 2025547 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
Met He Leu Leu Lys Phe Thr Tyr Val Val Ser Phe Trp He Thr Cys 
15 10 15 

Val He Phe Lys Met Ser Arg Thr He Lys Arg Ala Leu Leu Gly Ser 
40 20 25 30 

Phe He Phe He Leu Ala Ser Ala Ser Val Val Val Ala Ala He Val 

35 40 45 

Gly Ala He Glu Gly His Thr Thr Asp He Gly Phe Leu Gin Gly Ser 
50 55 60 

4 5 Val Leu Gly Val Val Ala Gly Val He Thr Ala Val Gin Leu Phe Gly 
65 70 75 80 

Pro Val Leu His Ser Asp Gin Pro Leu Ser Lys Val Ala Leu Leu Arg 

85 90 95 

Arg Val Val Asn Gly Lys Ala He Met Gly Leu Val Arg Pro Phe Val 
50 " 100 105 110 

Leu Lys Ala Tyr Gin Trp Gin He lie Ala Leu Asp Thr Ser Tyr Met 

115 120 125 

Glu Ser Ser Asn Leu Tyr Asp Phe Asn His Glu Lys Lys Gly Leu Ser 
130 135 140 

55 Lys Ser Ser lie Gin Asn lie Pro Met Phe Tyr Asn Arg Ser Glu His 
145 150 155 160 

Gin Thr Lys Ser Ser Cys Ser lie Cys Leu Gin Asp Trp Glu Glu Gly 

165 " 170 175 

Glu Val Gly Arg Lys Leu Ala Arg Cys Gly His Thr Phe His Met Asn 
60 180 185 190 

Cys lie Asp Glu Trp Leu Leu Arg Gin Glu Thr Cys Pro lie 
195 200 205 
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(A) NAME /KEY : peptide 

(B) LOCATION: 1..135 

(D) OTHER INFORMATION: / Ceres Seq. ID 2026209 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
5 Met Glu Pro Thr Glu Lys Ser Met Leu Leu Glu Thr Thr Ser Thr Thr 
1 5 J 10 15 

Lys Met Glu Thr Lys Tyr Glu Asp Met Leu Pro Val Met Ala Glu Lys 

20 2 5 30 

Met Asp Val Glu Glu Phe Val Ser Glu Leu Cys Lys Gly Phe Ser Leu 
10 1 35 40 45~ 

Leu Ala Asp Pro Glu Arg His Leu lie Thr Ala Glu Ser Leu Arg Arg 

5 0 5 5 60 

Asn Ser Gly lie Leu Gly lie Glu Gly Met Ser Lys Glu Asp Ala Gin 
65 70 75 80 

15 Gly Met Val Arg Glu Gly Asp Leu Asp Gly Asp Gly Ala Leu Asn Gin 

85 90 95 

Thr Glu Phe Cys Val Leu Met Val Arg Leu Ser Pro Glu Met Met Glu 

100 105 110 

Asp Ala Glu Thr Trp Leu Glu Lys Ala Leu Thr Gin Glu Leu Cys Asn 
20 115 120 125 

His Asn Leu Ser Ser Met Pro 
130 135 
(2) INFORMATION FOR SEQ ID NO: 38 6: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

30 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..128 

(D) OTHER INFORMATION: / Ceres Seq. ID 20^210 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 
35 Met Leu Leu Glu Thr Thr Ser Thr Thr Lys Met Glu Thr Lys Tyr Glu 
15 10 15 

Asd Met Leu Pro Val Met Ala Glu Lys Met Asp Val Glu Glu Phe Val 

20 25 30 

Ser Glu Leu Cys Lys Gly Phe Ser Leu Leu Ala Asp Pro Glu Arg His 
4 0 35 4 0 4 5 

Leu lie Thr Ala Glu Ser Leu Arg Arg Asn Ser Gly lie Leu Gly lie 

50 55 60 

Glu Gly Met Ser Lys Glu Asp Ala Gin Gly Met Val Arg Glu Gly Asp 
65 70 75 80 

4 5 Leu Asp Gly Asp Gly Ala Leu Asn Gin Thr Glu Phe Cys Val Leu Met 

85 90 95 

Val Arg Leu Ser Pro Glu Met Met Glu Asp Ala Glu Thr Trp Leu Glu 

100 105 110 

Lys Ala Leu Thr Gin Glu Leu Cys Asn His Asn Leu Ser Ser Met Pro 
50 115 120 125 

(2) INFORMATION FOR SEQ ID NO: 387: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
60 (B) LOCATION: 1..552 

(D) OTHER INFORMATION: / Ceres Seq. ID 2026982 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 
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atgaagtata aaaacgtttt tcttaaatta tggatgtctt ctctttatat agtgcccact 60 
tcctcattag cattctacag tctgcacact tatcacttct ttcgaaatct ctctctatcc 120 
ctctcaaaaa tgaagctctc tgtgcgtttt atctccgctg ctcttctctt gttcatggta 180 
ttcattgcca cagggatggg tccagtcacc gtggaggcac gcacgtgtga gtcaaaaagc 240 
5 cataggttca agggtccatg tgtgagcaca cacaactgtg caaacgtgtg ccacaacgaa 300 
ggcttcggcg gaggtaaatg ccgtggattc cgtcgtcgtt gctactgcac aagacactgc 360 
tgatccatcc attctcatga ctcaaatctt cgatccatcg tcagtgtgtt acttctttct 420 
tatctaaatc ttccgtacgg taccatgtcg taccgtacat gagtgttttc tcgaataagt 480 
cattggtttg tgtgtttccg gttttaatgt aatgttaaat caattaatgg cttttaatat 540 
10 attgtattat gg 

(2) INFORMATION FOR SEQ ID NO: 38 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
20 (B) LOCATION: 1..120 

(D) OTHER INFORMATION: / Ceres Seq. ID 2026983 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 





Met 


Lys 


Tyr 


Lys 


Asn 


Val 


Phe 


Leu 


Lys 


Leu 


Trp 


Met 


Ser 


Ser 


Leu 


Tyr 




1 








5 










10 










15 




25 


He 


Val 


Pro 


Thr 


Ser 


Ser 


Leu 


Ala 


Phe 


Tyr 


Ser 


Leu 


His 


Thr 


Tyr 


His 










20 










25 










30 








Phe 


Phe 


Arg 


Asn 


Leu 


Ser 


Leu 


Ser 


Leu 


Ser 


Lys 


Met 


Lys 


Leu 


Ser 


Val 








35 










40 










45 










Arg 


Phe 


He 


Ser 


Ala 


Ala 


Leu 


Leu 


Leu 


Phe 


Met 


Val 


Phe 


He 


Ala 


Thr 


30 




50 










55 










60 












Gly 


Met 


Gly 


Pro 


Val 


Thr 


Val 


Glu 


Ala 


Arg 


Thr 


Cys 


Glu 


Ser 


Lys 


Ser 




65 










70 










75 










80 




His 


Arg 


Phe 


Lys 


Gly 


Pro 


Cys 


Val 


Ser 


Thr 


His 


At - 


Cys 


Ala 


Asn 


Val 












85 










90 










95 




35 


Cys 


His 


Asn 


Glu 


Gly 


Phe 


Gly 


Gly 


Gly 


Lys 


Cys 


Arg 


Gly 


Phe 


Arg 


Arg 










100 










105 










110 








Arg 


Cys 


Tyr 


Cys 


Thr 


Arg 


His 


Cys 
























115 










120 




















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:3E 


39: 

















4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
45 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..10 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 2026984 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

Met Ser Ser Leu Tyr lie Val Pro Thr Ser Ser Leu Ala Phe Tyr Ser 
15 10 15 

Leu His Thr Tyr His Phe Phe Arg Asn Leu Ser Leu Ser Leu Ser Lys 
20 25 30 

5 5 Met Lys Leu Ser Val Arg Phe He Ser Ala Ala Leu Leu Leu Phe Met 

35 40 45 

Val Phe He Ala Thr Gly Met Gly Pro Val Thr Val Glu Ala Arg Thr 

50 55 60 

Cys Glu Ser Lvs Ser His Arg Phe Lys Gly Pro Cys Val Ser Thr His 
60 65 70 75 80 

Asn Cys Ala Asn Val Cys His Asn Glu Gly Phe Gly Gly Gly Lys Cys 
8 5 90 95 



WO 00/40695 



PCT/US00/00466 



517 

Arg Gly Phe Arg Arg Arg Cys Tyr Cys Thr Arg His Cys 

100 105 
(2) INFORMATION FOR SEQ ID NO: 390: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 77 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .77 

(D) OTHER INFORMATION: / Ceres Seq. ID 2026985 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 
15 Met Lys Leu Ser Val Arg Phe lie Ser Ala Ala Leu Leu Leu Phe Met 
15 10 15 

Val Phe lie Ala Thr Gly Met Gly Pro Val Thr Val Glu Ala Arg Thr 

20 25 30 

Cys Glu Ser Lys Ser His Arg Phe Lys Gly Pro Cys Val Ser Thr His 
2 0 35 40 4 5 

Asn Cys Ala Asn Val Cys His Asn Glu Gly Phe Gly Gly Gly Lys Cys 

50 55 60 

Arg Gly Phe Arg Arg Arg Cys Tyr Cys Thr Arg His Cys 
65 7 0 7 5 

25 (2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..66 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 2026986 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:391: 



40 



4 5 His Val 
65 

(2) INFORMATION FOR SEQ ID NO: 3 92: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1760 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
55 (A) NAME/KEY: - 

(B) LOCATION: 1 . . 17 60 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027300 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 
ctcgtttcat gagagaaaca aaaaatctta tctttctgac aacggtggtg gtgtgtgatg 60 
60 ttgtcagcca tggcaaagac agctacaacg atctctctct cttttcttct tcttctgtgt 120 
ctgtctttaa atagagtctc tttgaaacga agcagctttg tcctgttctg atatccatgg 180 
ctttaccttc gtgcttaaag actggagctt taatgtctcc ggccactgga tttaattttt 240 



Met 


Asp 


Val 


Phe 


Ser 


Leu 


Tyr 


Ser 


Ala 


His 


Phe 


Leu 


He 


Ser 


He 


Leu 


1 








5 










10 










15 




Gin 


Ser 


Ala 


His 
20 


Leu 


Ser 


Leu 


Leu 


Ser 
25 


Lys 


Ser 


Leu 


Ser 


He 
30 


Pro 


Leu 


Lys 


Asn 


Glu 
35 


Ala 


Leu 


Cys 


Ala 


Phe 
40 


Tyr 


Leu 


Arg 


Cys 


Ser 
45 


Ser 


Leu 


Val 


His 


Gly 
50 


He 


His 


Cys 


His 


Arg 
55 


Asp 


Gly 


Ser 


Ser 


His 
60 


Arg 


Gly 


Gly 


Thr 
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ccggcagtct tatgaagtca gacagcggct tcgccgttcc gacaaagtta cagagtactc 300 
gaaaaggtga tcgagagaga ttgagagtcc aagctatttt cagtttccct cctgcgtttc 360 
tgacaagaaa cggtcgagct gagaaacaga aacagctcaa acaagaactt cttgaagcca 420 
ttgagcctct tgaacgtggt gctacggcct cgcctgatga ccagcttcgg attgatcagt 480 
5 tagcgcgtaa agtggaagca gttaatccca ccaaggagcc tttgaagtct gatttggtca 540 
atggaaaatg ggagctcatt tatacaacct ctgcttcgat tttgcaggca aagaaaccaa 600 
ggttcttaag atcaataacc aactaccaat ctatcaatgt ggatacactt aaggtgcaaa 660 
acatggagac ttggcctttc tataactcgg taactggaga cataaaaccc ctcaattcga 720 
agaaggttgc tg^.gaaactc caagtgttta aaattctcgg atttattcct ataaaagcac 780 

10 ctgatagcgc ccgcggtgaa cttgagatta cctatgtgga cgaggaacta cggttatcaa 840 
gagatctttc attcatgtct catgtctggt tccatgacct ttctacattg tttctgttga 900 
tttatgaatc ttcaataaca ctacaagtta aaccaattag aagtgagttg gttatggtcc 960 
ggtacaaaga tcacaaccgg agaaggattg aagagtggtc aaagcgatgg tcagagttgg 1020 
ttggttccgt cgaaacagga agaaagagtc tctcagaaac gacccagcca ctctttaaaa 1080 

15 ctccattgga aaagctttgt cctgttttga tatccatggc tttaccttgg tgcttaaaga 1140 
ctggagtttt aacgtctccg gccgccggat ttaatcatcc atcggacagt ggcttcgccg 1200 
ttccgacgaa gttactgagt attcgaaaag gtgatcggga aagattgaga atccaagctg 1260 
ttttcagctt tcctccaaga aacggtggag ccgagaagcg gaaacagctc aaacatgaac 1320 
ttgtcgaagc cattgagcct cttgaacgtg gtgccactgc ctcgcctga.t gatcagcttc 1380 

20 tgattgatca gttagcacgt aaggtggaag cagtaaaccc aaccaaggag cctctgaagt 1440 
ctgatttgat caatgggaaa tgggagctca tttacacaac atctgctgcg attttgcaag 1500 
caaagaaacc aaggttctta agatcgttaa ctaactacca atgtatcaat atggatacac 1560 
taaaggtgca aagaatggag acttggcctt tctataactc ggtaactgga gacttgacac 1620 
ccctcaactc gaagacggtt gctgtgaaac ttcaagtgtt taaaattctc ggctttattc 1680 

25 cggtaaaagc acctgatggt actgcacgcg gtgaactaga gattacctat gtggacgagg 1740 
aactacgcga caaactttga 
(2) INFORMATION FOR SEQ ID NO: 3 93: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

35 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 527 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027301 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:393: 
Met Ala Leu Pro Ser Cys Leu Lys Thr Gly Ala Leu Met Ser Pro Ala 
40 1 5 10 15 

Thr Gly Phe Asn Phe Ser Gly Ser Leu Met Lys Ser Asp Ser Gly Phe 

2 0 25 30 

Ala Val Pro Thr Lys Leu Gin Ser Thr Arg Lys Gly Asp Arg Glu Arg 
35 40 45 

4 5 Leu Arg Val Gin Ala lie Phe Ser Phe Pro Pro Ala Phe Leu Thr Arg 
50 55 60 

Asn Gly Arg Ala Glu Lys Gin Lys Gin Leu Lys Gin Glu Leu Leu Glu 
65 70 75 80 

Ala lie Glu Pro Leu Glu Arg Gly Ala Thr Ala Ser Pro Asp Asp Gin 
50 85 90 95 

Leu Arg lie Asp Gin Leu Ala Arg Lvs Val Glu Ala Val Asn Pro Thr 

100 105 110 

Lys Glu Pro Leu Lys Ser Asp Leu Val Asn Gly Lys Trp Glu Leu lie 
115 12 0 " 125 

55 Tyr Thr Thr Ser Ala Ser lie Leu Gin Ala Lys Lys Pro Arg Phe Leu 
130 135 140 

Arg Ser lie Thr Asn Tyr Gin Ser lie Asn Val Asp Thr Leu Lys Val 
145 150 155 160 

Gin Asn Met Glu Thr Trp Pro Phe Tyr Asn Ser Val Thr Gly Asp lie 
60 165 170 175 

Lys Pro Leu Asn Ser Lys Lys Val Ala Val Lys Leu Gin Val Phe Lys 
180 185 190 
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lie 


Leu 


Gly 


Phe 


He 


Pro 


He 


Lys 


Ala 


Pro 


Asp 


Ser 


Ala 


Arg 


Gly 


Glu 








195 










2 00 










205 










Leu 


Glu 


He 


Thr 


Tyr 


Val 


Asp 


Glu 


Glu 


Leu 


Arg 


Leu 


Ser 


Arg 


Asp 


Leu 






210 










215 










220 










5 


Ser 


Phe 


Met 


Ser 


His 


Val 


Trp 


Phe 


His 


Asp 


Leu 


Ser 


Thr 


Leu 


Phe 


Leu 




225 










230 










235 










240 




Leu 


He 


Tyr 


Glu 


Ser 


Ser 


He 


Thr 


Leu 


Gin 


Val 


Lys 


Pro 


He 


Arg 


Ser 












245 










250 










255 






Glu 


Leu 


Val 


Met 


Val 


Arg 


Tyr 


Lys 


Asp 


His 


Asn 


Arg 


Arg 


Arg 


He 


Glu 


10 








260 










265 










270 








Glu 


Trp 


Ser 


Lys 


Arg 


Trp 


Ser 


Glu 


Leu 


Val 


Gly 


Ser 


Val 


Glu 


Thr 


Gly 








275 










280 










285 










Arg 


Lys 


Ser 


Leu 


Ser 


Glu 


Thr 


Thr 


Gin 


Pro 


Leu 


Phe 


Lys 


Thr 


Pro 


Leu 






290 










295 










300 










15 


Glu 


Lys 


Leu 


Cys 


Pro 


Val 


Leu 


He 


Ser 


Met 


Ala 


Leu 


Pro 


Trp 


Cys 


Leu 




305 










310 










315 










320 




Lys 


Thr 


Gly 


Val 


Leu 


Thr 


Ser 


Pro 


Ala 


Ala 


Gly 


Phe 


Asn 


His 


Pro 


Ser 












325 










330 










335 






Asp 


Ser 


Gly 


Phe 


Ala 


Val 


Pro 


Thr 


Lys 


Leu 


Leu 


Ser 


He 


Arg 


Lys 


Gly 


20 








340 










345 










350 








Asp 


Arg 


Glu 


Arg 


Leu 


Arg 


He 


Gin 


Ala 


Val 


Phe 


Ser 


Phe 


Pro 


Pro 


Arg 








355 










360 










3 65 










Asn 


Gly 


Gly 


Ala 


Glu 


Lys 


Arg 


Lys 


Gin 


Leu 


Lys 


His 


Glu 


Leu 


Val 


Glu 






370 










375 










380 










25 


Ala 


He 


Glu 


Pro 


Leu 


Glu 


Arg 


Gly 


Ala 


Thr 


Ala 


Ser 


Pro 


Asp 


Asp 


Gin 




385 










390 










395 










400 




Leu 


Leu 


He 


Asp 


Gin 


Leu 


Ala 


Arg 


Lys 


Val 


Glu 


Ala 


Val 


Asn 


Pro 


Thr 












405 










410 










415 






Lys 


Glu 


Pro 


Leu 


Lys 


Ser 


Asp 


Leu 


He 


Asn 


Gly 


Lys 


Trp 


Glu 


Leu 


He 


30 








420 










425 










430 








Tyr 


Thr 


Thr 


Ser 


Ala 


Ala 


He 


Leu 


Gin 


Ala 


Lys 


Lys 


Pro 


Arg 


Phe 


Leu 








435 










440 










445 










Arg 


Ser 


Leu 


Thr 


Asn 


Tyr 


Gin 


Cys 


lie 


Asn 


Met 


Asp 


Thr 


Leu 


Lys 


Val 






450 










455 










4 60 










35 


Gin 


Arg 


Met 


Glu 


Thr 


Trp 


Pro 


Phe 


Tyr 


Asn 


Ser 


Val 


Thr 


Gly 


Asp 


Leu 




465 










470 










475 










480 




Thr 


Pro 


Leu 


Asn 


Ser 


Lys 


Thr 


Val 


Ala 


Val 


Lys 


Leu 


Gin 


Val 


Phe 


Lys 












485 










490 










495 






He 


Leu 


Gly 


Phe 


He 


Pro 


Val 


Lys 


Ala 


Pro 


Asp 


Gly 


Thr 


Ala 


Arg 


Gly 


40 








500 










505 










510 








Glu 


Leu 


Glu 


He 


Thr 


Tyr 


Val 


Asp 


Glu 


Glu 


Leu 


Arg 


Asp 


Lys 


Leu 





515 520 525 



(2) INFORMATION FOR SEQ ID NO: 3 94: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 515 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..515 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027302 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: 
55 Met Ser Pro Ala Thr Gly Phe Asn Phe Ser Gly Ser Leu Met Lys Ser 
15 10 15 

Asp Ser Gly Phe Ala Val Pro Thr Lys Leu Gin Ser Thr Arg Lys Gly 

20 25 30 

Asp Arg Glu Arg Leu Arg Val Gin Ala He Phe Ser Phe Pro Pro Ala 
60 35 40 45 

Phe Leu Thr Arg Asn Gly Arg Ala Glu Lys Gin Lys Gin Leu Lys Gin 
50 55 60 
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blU 


Leu 


Leu 


blU 


Hid 


T 1 Q 

lie 


Pin 
blU 


Pro 


Leu 


Lj-LU 


Arg 


biy 


Ala 


i nr 


Ala 


ber 




D O 










/ u 










/ O 










Q Pi 
C5 U 




D r- /-s 

r JO O 


Asp 


Asp 


Kj 111 


Leu 

O O 


7\ V <~r 

ax y 


J. _L 


riSp 




Leu 


Alci 




Lys 


T/a "1 
Val 


Pin 


Ala 

Aia 




Vdl 


Asn 


Pro 


i nx 
100 


Lys 


LjI U 


13 v r> 


L6U 


Lys 
105 




Asp 


Leu 


V ci_L 


Asn 
110 


Kj±y 


Lys 




Trp 


b 1U 


Leu 
115 


i le 


Tyr 


T K T" 

i nr 


i nr 


oer 
120 


ai a 


Ser 


lie 


Leu 


pi n 

b±n 
125 


Aia 


Lys 


Lys 




Pro 


Arg 


rne 


Leu 


Arg 


c; o >~ 
ber 


lie 


i nr 


Asn 


Tyr 


bin 


ber 


lie 


Asn 


val 


Asp 


10 




130 










135 










140 












1 nr 


Leu 


Lys 


val 


bin 


Asn 


Met: 


blU 


i nr 


Trp 


Pro 


rne 


Tyr 


Asn 


ber 


Val 




1 4 O 










IjU 










loo 










1 /"A 

1 D U 




Thr 


bl y 


Asp 


i le 


Lys 
loo 


Pro 


Leu 


Asn 


ber 


Lys 
1 / u 


Lys 


val 


Ala 


Val 


Lys 
1 /o 


Leu 


1 D 


bin 


val 


Pne 


Lys 
180 


lie 


Leu 


bly 


rne 


T "I 

lie 
185 


Pro 


lie 


Lys 


Ala 


Pro 
190 


Asp 


Ser 




Ala 


Arg 


bly 
195 


bill 


Leu 


blU 


T "I 

lie 


i nr 
200 


Tyr 


val 


Asp 


p*" i i -i 

blU 


bl U 

205 


Leu 


Arg 


Leu 




ber 


Arg 


Asp 


Leu 


C -V 

ber 


¥ ne 


ixi et: 


O A K- 

ber 


rllS 


val 


Trp 


rne 


Ml S 


Asp 


Leu 


ber 


20 




210 










215 










220 












l nr 


Leu 


r-) L, „ 

rne 


Leu 


Leu 


lie 


Tyr 


p" i i -i 

blU 


C ^ 

ber 


C -t- 

ber 


lie 


m V-> -w-- 

i nr 


Leu 


bin 


val 


Lys 




o o c 

2 25 










O O pi 

2 J U 










o o c 
2 Jo 










24 0 




Pro 


lie 


Arg 


Ser 


blU 
2 4 O 


Leu 


veil 


Met 


val 


Arg 

OCA 

Z JU 


Tyr 


Lys 


Asp 


His 


Asn 

o c: c 

2oo 


Arg 


2. D 


Arg 


Arg 


lie 


Glu 

260 


blU 


Trp 


Ser 


Lys 


Arg 
265 


Trp 


Ser 


Glu 


Leu 


Val 
270 


Gly 


Ser 




Val 


Glu 


Thr 
275 


Gly 


Arg 


Lys 


Ser 


Leu 
280 


Ser 


Glu 


Thr 


Thr 


Gin 
285 


Pro 


Leu 


Phe 




Lys 


Thr 


Pro 


Leu 


Glu 


Lys 


Leu 


Cys 


Pro 


val 


Leu 


He 


Ser 


Met 


Ala 


Leu 


30 




290 










295 










300 












Pro 


Trp 


Cys 


Leu 


Lys 


Thr 


Gly 


Val 


Leu 


Thr 


Ser 


Pro 


Ala 


Ala 


Gly 


Phe 




3 05 










O 1 Pi 

J 1 U 










J lo 










32 0 




Asn 


His 


Pro 


Ser 


Asp 

o o c 
J2 o 


ber 


P"* "1 mm 

bly 


Phe 


Ala 


IT- 1 

Val 

J JU 


Pro 


Thr 


Lys 


Leu 


Leu 

O "2 c 

o J o 


Ser 


O O 


lie 


Arg 


Lys 


Gly 
340 


Asp 


Arg 


Glu 


Arg 


Leu 
345 


Arg 


He 


Gin 


Ala 


Val 
350 


Phe 


Ser 




Phe 


Pro 


Pro 
355 


Arg 


Asn 


Gly 


Gly 


Ala 
360 


Glu 


Lys 


Arg 


Lys 


bin 
365 


Leu 


Lys 


His 




Glu 


Leu 


val 


Glu 


Ala 


T "1 J-N 

lie 


Glu 


Pro 


Leu 


Glu 


Arg 


Gly 


Ala 


Thr 


Ala 


Ser 


40 




370 










375 










380 












Pro 


Asp 


Asp 


Gin 


Leu 


Leu 


lie 


Asp 


Gin 


Leu 


Ala 


Arg 


Lys 


Val 


Glu 


Ala 




385 










o a n 

3 y o 










O A C 

3 9o 










4 00 




Val 


Asn 


Pro 


Thr 


Lys 
4 05 


Glu 


Pro 


Leu 


Lys 


Ser 

j 1 A 

4 1 U 


Asp 


Leu 


He 


Asn 


Gly 
415 


Lys 


4 O 


Trp 


Glu 


Leu 


lie 
420 


Tyr 


Thr 


Thr 


Ser 


Ala 
425 


Ala 


He 


Leu 


Gin 


Ala 
430 


Lys 


Lys 




Pro 


Arg 


Phe 
435 


Leu 


Arg 


Ser 


Leu 


Thr 
440 


Asn 


Tyr 


Gin 


Cys 


I le 
445 


Asn 


Met 


Asp 




Thr 


Leu 


Lys 


Val 


Gin 


Arg 


Met 


Glu 


Thr 


Trp 


Pro 


Phe 


Tyr 


Asn 


Ser 


Val 


50 




450 










455 










460 












Thr 


Gly 


Asp 


Leu 


Thr 


Pro 


Leu 


Asn 


Ser 


Lys 


Thr 


Val 


Ala 


Val 


Lys 


Leu 




465 










470 










475 










480 




Gin 


Val 


Phe 


Lys 


lie 
485 


Leu 


Gly 


Phe 


He 


Pro 
490 


Val 


Lys 


Ala 


Pro 


Asp 
495 


Gly 


55 


Thr 


Ala 


Arg 


Gly 
500 


Glu 


Leu 


Glu 


lie 


Thr 
505 


Tyr 


Val 


Asp 


Glu 


Glu 
510 


Leu 


Arg 



Asp Lys Leu 
515 

(2) INFORMATION FOR SEQ ID NO: 395: 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE : 

5 (A) NAME /KEY : peptide 

(B) LOCATION: 1. .502 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027303 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 
Met Lys Ser Asp Ser Gly Phe Ala Val Pro Thr Lys Leu Gin Ser Thr 
10 1 5 10 15 

Arg Lys Gly Asp Arg Glu Arg Leu Arg Val Gin Ala lie Phe Ser Phe 

20 25 30 

Pro Pro Ala Phe Leu Thr Arg Asn Gly Arg Ala Glu Lys Gin Lys Gin 
35 4 0 4 5 

15 Leu Lys Gin Glu Leu Leu Glu Ala lie Glu Pro Leu Glu Arg Gly Ala 
50 55 60 

Thr Ala Ser Pro Asp Asp Gin Leu Arg lie Asp Gin Leu Ala Arg Lys 
65 7 0 7 5 80 

Val Glu Ala Val Asn Pro Thr Lys Glu Pro Leu Lys Ser Asp Leu Val 
20 85 90 95 

Asn Gly Lys Trp Glu Leu lie Tyr Thr Thr Ser Ala Ser lie Leu Gin 

100 105 110 

Ala Lys Lys Pro Arg Phe Leu Arg Ser lie Thr Asn Tyr Gin Ser lie 
115 120 125 

2 5 Asn Val Asp Thr Leu Lys Val Gin Asn Met Glu Thr Trp Pro Phe Tyr 
130 135 140 

Asn Ser Val Thr Gly Asp lie Lys Pro Leu Asn Ser Lys Lys Val Ala 
145 150 155 160 

Val Lys Leu Gin Val Phe Lys lie Leu Gly Phe lie Pro lie Lys Ala 
30 165 170 175 

Pro Asp Ser Ala Arg Gly Glu Leu Glu lie Thr Tyr Val Asp Glu Glu 

180 185 190 

Leu Arg Leu Ser Arg Asp Leu Ser Phe Met Ser His Val Trp Phe His 
195 200 205 

35 Asp Leu Ser Thr Leu Phe Leu Leu lie Tyr Glu Ser Ser lie Thr Leu 
210 215 220 

Gin Val Lys Pro lie Arg Ser Glu Leu Val Met Val Arg Tyr Lys Asp 
225 ^ 230 235 240 

His Asn Arg Arg Arg lie Glu Glu Trp Ser Lys Arg Trp Ser Glu Leu 
40 245 250 255 

Val Gly Ser Val Glu Thr Gly Arg Lys Ser Leu Ser Glu Thr Thr Gin 

260 265 270 

Pro Leu Phe Lys Thr Pro Leu Glu Lys Leu Cys Pro Val Leu lie Ser 
275 280 285 

4 5 Met Ala Leu Pro Trp Cys Leu Lys Thr Gly Val Leu Thr Ser Pro Ala 
290 295 300 

Ala Gly Phe Asn His Pro Ser Asp Ser Gly Phe Ala Val Pro Thr Lys 
305 310 315 320 

Leu Leu Ser lie Arg Lys Gly Asp Arg Glu Arg Leu Arg lie Gin Ala 
50 325 330 335 

Val Phe Ser Phe Pro Pro Arg Asn Gly Gly Ala Glu Lys Arg Lys Gin 

340 345 350 

Leu Lys His Glu Leu Val Glu Ala lie Glu Pro Leu Glu Arg Gly Ala 
355 360 365 

55 Thr Ala Ser Pro Asp Asp Gin Leu Leu lie Asp Gin Leu Ala Arg Lys 
370 375 380 

Val Glu Ala Val Asn Pro Thr Lys Glu Pro Leu Lys Ser Asp Leu lie 
385 390 395 400 

Asn Gly Lys Trp Glu Leu lie Tyr Thr Thr Ser Ala Ala lie Leu Gin 
60 " 405 410 415 

Ala Lys Lys Pro Arg Phe Leu Arg Ser Leu Thr Asn Tyr Gin Cys lie 
420 425 430 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



522 

Asn Met Asp Thr Leu Lys Val Gin Arg Met Glu Thr Trp Pro Phe Tyr 

4 35 ' 4 40 44 5 

Asn Ser Val Thr Gly Asp Leu Thr Pro Leu Asn Ser Lys Thr Val Ala 

450 455 460 

Val Lys Leu Gin Val Phe Lys lie Leu Gly Phe lie Pro Val Lys Ala 
465 470 475 480 

Pro Asp Gly Thr Ala Arg Gly Glu Leu Glu lie Thr Tyr Val Asp Glu 

485 490 495 

Glu Leu Arg Asp Lys Leu 
50 0 

(2) INFORMATION FOR SEQ ID NO: 396: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 539 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027375 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 
actgtgttcg ataataatgt cgacaggaga agcgatacca agagtcgccg tcgtcgtttt 
cattctcaac ggaaactcaa tcttattagg tcgccgccgt tcctcaatcg gcaactccac 
tttcgctctt cccggtggcc acctcgaatt cggagagagc tttgaagaat gtgcagcgag 
agaagtaatg gaggaaacag gtctaaagat tgaaaagatg aagcttttga ctgttacaaa 
caatgtcttc aaagaagcac caacgccatc acactacgtc tctgtttcga tacgtgcggt 
gttggtggat ccaagtcaag aaccgaagaa tatggaacca gagaagtgtg aaggatggga 
ttggtatgat tgggagaatc taccaaagcc tttgttttgg ccacttgaga aattgtttgg 
aagtggtttc aatcctttca ctcatggtgg tggagactaa tagatgtaag agttaatgat 
tgatttggga ttgaatgttg cacaaattgg gcatttggtc tagtggtatg atctygctt 
(2) INFORMATION FOR SEQ ID NO: 3 97: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



(xi) 



MOLECULE TYPE 
FEATURE: 

(A) NAME /KEY 

(B) LOCATION 



peptide 
1. . 152 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027376 
SEQUENCE DESCRIPTION: SEQ ID NO: 397: 



Leu 


Cys 


Ser 


He 


He 


Met 


Ser 


Thr 


Gly 


Glu 


Ala 


He 


Pro 


Arg 


Val 


Ala 


1 








5 










10 










15 




Val 


Val 


Val 


Phe 


He 


Leu 


Asn 


Gly 


Asn 


Ser 


He 


Leu 


Leu 


Gly 


Arg 


Arg 








20 










25 










30 






Arg 


Ser 


Ser 


He 


Gly 


Asn 


Ser 


Thr 


Phe 


Ala 


Leu 


Pro 


Gly 


Gly 


His 


Leu 






35 










40 










45 








Glu 


Phe 


Gly 


Glu 


Ser 


Phe 


Glu 


Glu 


Cys 


Ala 


Ala 


Arg 


Glu 


Val 


Met 


Glu 




50 










55 










60 










Glu 


Thr 


Gly 


Leu 


Lys 


He 


Glu 


Lys 


Met 


Lys 


Leu 


Leu 


Thr 


Val 


Thr 


Asn 


65 










70 










75 










80 


Asn 


Val 


Phe 


Lys 


Glu 


Ala 


Pro 


Thr 


Pro 


Ser 


His 


Tyr 


Val 


Ser 


Val 


Ser 










85 










90 










95 




He 


Arg 


Ala 


Val 


Leu 


Val 


Asp 


Pro 


Ser 


Gin 


Glu 


Pro 


Lys 


Asn 


Met 


Glu 








100 










105 










110 






Pro 


Glu 


Lys 


Cys 


Glu 


Gly 


Trp 


Asp 


Trp 


Tyr 


Asp 


Trp 


Glu 


Asn 


Leu 


Pro 






115 










120 










125 








Lys 


Pro 


Leu 


Phe 


Trp 


Pro 


Leu 


Glu 


Lys 


Leu 


Phe 


Gly 


Ser 


Gly 


Phe 


Asn 




130 










135 










140 










Pro 


Phe 


Thr 


His 


Gly 


Gly 


Gly 


Asp 



















60 
120 
180 
240 
300 
360 
420 
480 
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145 150 
(2) INFORMATION FOR SEQ ID NO: 3 98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

10 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 147 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027377 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
Met Ser Thr Gly Glu Ala lie Pro Arg Val Ala Val Val Val Phe lie 
15 1 5 10 15 

Leu Asn Gly Asn Ser lie Leu Leu Gly Arg Arg Arg Ser Ser lie Gly 

20 25 30 

Asn Ser Thr Phe Ala Leu Pro Gly Gly His Leu Glu Phe Gly Glu Ser 
35 40 " 45 

2 0 Phe Glu Glu Cys Ala Ala Arg Glu Val Met Glu Glu Thr Gly Leu Lys 
50 55 60 

lie Glu Lys Met Lys Leu Leu Thr Val Thr Asn Asn Val Phe Lys Glu 
65 " 70 75 80 

Ala Pro Thr Pro Ser His Tyr Val Ser Val Ser lie Arg Ala Val Leu 
25 85 90 95 

Val Asp Pro Ser Gin Glu Pro Lys Asn Met Glu Pro Glu Lys Cys Glu 

100 " 105 110 

Gly Trp Asp Trp Tyr Asp Trp Glu Asn Leu Pro Lys Pro Leu Phe Trp 
115 120 125 

30 Pro Leu Glu Lys Leu Phe Gly Ser Gly Phe Asn Pro Phe Thr His Gly 
130 " 135 ~ 140 

Gly Gly Asp 
145 

(2) INFORMATION FOR SEQ ID NO: 399: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
40 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..90 

(D) OTHER INFORMATION: / Ceres Seq. ID 2027378 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

Met Glu Glu Thr Gly Leu Lys lie Glu Lys Met Lys Leu Leu Thr Val 
1 5 10 15 

Thr Asn Asn Val Phe Lys Glu Ala Pro Thr Pro Ser His Tyr Val Ser 
20 25 30 

50 Val Ser lie Arg Ala Val Leu Val Asp Pro Ser Gin Glu Pro Lys Asn 
35 4 0 4 5 

Met Glu Pro Glu Lys Cys Glu Gly Trp Asp Trp Tyr Asp Trp Glu Asn 

50 55 60 

Leu Pro Lys Pro Leu Phe Trp Pro Leu Glu Lys Leu Phe Gly Ser Gly 
55 65 "* 70 75 80 

Phe Asn Pro Phe Thr His Gly Gly Gly Asp 

85 90 
(2) INFORMATION FOR SEQ ID NO: 400: 
(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 3983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
5 (B) LOCATION: 1..3983 

(D) OTHER INFORMATION: / Ceres Seq. ID 2028729 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

atgaacacga cgccgtttca ctcggatcct ccgccgtcga ggatccagcg taagctcgtt 60 

gtcgaagttg ttgaagctcg taatattctc cctaaagatg gtcaaggaag ctctagcgct 120 

10 nacgtcgttg tcgatttcga tgctcagaag aaacgaacct ccactaagtt ccgtgaccta 180 

aaccctattt ggaacgagat gcttgatttc gccgtctccg atcccaaaaa catggattac 240 

gacgagctcg atatcgagct ttataacgat aaaagatttg gtaacggagg tggccggaag 300 

aatcattttc tcggtagggt taagatctat ggaagccagt tctcgcgaag aggtgaagaa 360 

ggtcttgtgt atttcccttt ggagaagaag agtgtgttca gctggattcg cggcgagatt 420 

15 ggactcaaaa tctactatta cgacgaagcc gccgacgaag acacggcggg tggaggtgga 480 

ggacagcaac aacaacagca acagcaacaa tttcatccgc cgcaacaaga agccgatgaa 540 

caacaacacc agcaacaatt tcatcctccg ccgcagcaga tgatgaatat accaccggag 600 

aaacctaatg tagttgtggt tgaagaaggt agggttttcg aatcggctca gagtcagcgc 660 

tatacagaga cacatcagca acctccggtg gttattgttg aagaatcacc accgcagcat 720 

20 gtaatgcaag gtccaaatga taaccatcct caccgaaatg ataaccatcc tcaacggcca 780 

ccgtctccgc cgccacctcc atcggctggg gaagtacatt attatccacc ggaagtgagg 840 

aagatgcaag taggaagacc tcccggcgga gatagaatta gggttacgaa gagaccaccg 900 

aatggagatt attcacctag ggttatcaat agcaaaactg gaggaggaga gacgacgatg 960 

gagaagaaga ctcatcatcc ttacaatctt gttgagccaa tgcagtatct cttcgttcgg 1020 

25 attgtgaagg cgcgtggctt accacctaac gagagcgcgt atgttaaggt acggacgtcg 1080 

aaccatttcg tcaggtctaa accggccgtt aaccggcccg gcgaatcggt tgattcaccg 1140 

gagtggaatc aggtttttgc tcttggtcat aaccggtctg attccgctgt aactggtgcg 1200 

actcttgaga tctctgcttg ggatgcttcg tcggagagtt ttctcggagg agtttgtttt 1260 

gatctctctg aggttccggt tcgtgacccg ccggatagtc cgcttgctcc tcagtggtat 1320 

30 cggctcgaag gctccggcgc ggatcagaac tctgggagaa tttccggtga cattcagctc 1380 

tctgtttgga ttggtactca ggtagatgag gcatttccgg aggcttggag ctctgatgct 1440 

ccgcatgtag ctcacacgcg ttctaaggtg tstcaatcgc cgaaactttg gtacttgaga 1500 

gtgacggttc ttgaggcaca ggatttacac atagctccta atctcccgcc gttgactgcg 1560 

cctgagattc gtgtgaaagc tcaattaggg tttcagtcgg cgcgtacaag aagaggctca 1620 

35 atgaataacc acagtggttc gtttcattgg catgaggata tgatctttgt tgctggagag 1680 

ccgttggaag attgcttggt tctgatggtg gaagaccgga cgactaaaga agcaacactt 1740 

ctaggacatg ccatgatccc agtgagctcc atcgagcagc gaattgatga gcgttttgtg 1800 

ccgtcgaaat ggcacactct ggaaggagaa ggtggaggtg gaggtggagg aggaggacct 1860 

ggaggtggtg gtggtggtgg accttattgt ggaaggatta gccttagact ttgtctcgaa 1920 

4 0 ggtgggtatc atgtgcttga agaggcggcg catgtatgca gcgatttccg tccgacggct 1980 

aagcagctat ggaaaccgcc gattggaata cttgagttgg ggattcttgg agctcgtggg 2040 

ttgttgccga tgaaggcgaa aaacggaggg aaaggttcca ctgatgctta ttgtgttgct 2100 

aagtacggga agaaatgggt caggactcga accataacag acagttttga cccgaggtgg 2160 

cacgagcagt atacgtggca ggtttatgat ccttgcaccg tgctaactgt tggagtcttc 2220 

4 5 gacaattgga ggatgttctc tgacgcctcc gatgatagac ctgacacacg gattgggaag 2280 

atacggatcc gggtgtcgac gttagagagc aacaaagtgt acaccaattc atatcctctg 2340 

ttggttttgt tacctagcgg tatgaaaaaa atgggtgaaa ttgaagtggc agtccggttt 2400 

gcatgcccgt ctctgctgcc tgatgtttgt gcagcttatg gacagccgct tctgcctcgg 2460 

atgcactaca taaggcctct aggtgtagca caacaagatg cattaagagg ggccgccacg 2520 

50 aaaatggtag cagcttggct ggctcgagca gaaccaccat tgggaccaga ggtagttcga 2580 

tatatgttag atgcagattc gcatgcatgg agcatgagga aaagcaaagc gaattggtac 2640 

agaattgttg gtgttttagc ttgggcagtg ggtttagcta agtggttgga taatatcagg 2700 

cggtggagga atccagtgac gacggtgcta gtccatattc tatatctggt tcttgtttgg 2760 

taccctgatt tggtagtccc gactgcattc ttgtacgtgg tgatgatcgg agtttggtac 2820 

55 taccggttta gacccaagat accggctggt atggatatcc gcttatcaca agctgaaacc 2880 

gtcgatcctg atgagctaga tgaagaattc gacaccatac caagctcaag gcgaccagaa 2940 

gtaatccgag ctaggtacga ccgattaagg atcttagcag tgagggttca gaccattcta 3000 

ggagattttg cagcgcaagg agaacggatt caagcgttgg ttagctggag agatccgaga 3060 

gcgacaaagc tgttcatagc aatctgtttg gtaatcacaa tagttctgta tgcagttcct 3120 

60 gcgaaaatgg tggcggtggc tctaggagtt tcagatagtg tgccaacggc gaaacaagac 3180 

acgaaagaat ctctaaagaa atcattttca tcgcttcgct tcgatttttc ttcaatggct 3240 

gtcgtcggcg ctccaatatc gtctccggcg gctcagctgc agacacaatt tctctccaat 3300 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(ii) 
(ix) 



(xi) 



525 

cccattctcc cccgctttcg ccggtctttc tccaccggaa aatcaccagc aactttctcc 
gtcgtagcta tggctcccca gaaaaaggtg aacaaatatg atgccaagtg gaagaaacaa 
tggtacggag ctggattgtt tttcgaaggg agtgagcaaa taaacgttga tgttttcaag 
aagctggaga agcgaaaagt gttgagcaac gttgagaaat ctggcctgct gtcaaaagca 
gaggggttgg gactcacatt gtcatctctt gagaagctta aagtcttctc caaagcagag 
gaccttggtc ttctcagtct ccttgagaac ttagctggaa catcgcctgc ggtcttagcc 
tcggctgcat taccagctct cacggctgct attgtagccg tggtgttgat cccggatgac 
tcaactactc tagtggttgc tcaggcggtt ttggccggtg ctcttgcgct tacaggggtt 
gttttgttgg ttggttctgt tgttttggat ggacttcaag aagctgactg attctttctc 
tgtaaaccga acataaaccc atgtcttgtc caattgattt ttgtcagttg ctgatttata 
gctgtatggt tcagttgttt atggttagtc caagacataa gctgagtgat agaaagaagc 
tttataataa ttaaacaaat att 
(2) INFORMATION FOR SEQ ID NO : 4 0 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1276 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 1276 

(D) OTHER INFORMATION: / Ceres Seq. ID 2028730 
SEQUENCE DESCRIPTION: SEQ ID NO: 401: 
Met Asn Thr Thr Pro Phe His Ser Asp Pro Pro Pro Ser Arg lie Gin 
15 10 15 

Arg Lys Leu Val Val Glu Val Val Glu Ala Arg Asn lie Leu Pro Lys 

2 0 25 30 

Asp Gly Gin Gly Ser Ser Ser Ala Tyr Val Val Val Asp Phe Asp Ala 

~ 35 4 0 4 5 

Gin Lys Lys Arg Thr Ser Thr Lys Phe Arg Asp Leu Asn Pro lie Trp 

5 0 55 6 0 

Asn Glu Met Leu Asp Phe Ala Val Ser Asp Pro Lys Asn Met Asp Tyr 
65 70 75 80 

Asp Glu Leu Asp lie Glu Val Tyr Asn Asp Lys Arg Phe Gly Asn Gly 

85 90 95 

Gly Gly Arg Lys Asn His Phe Leu Gly Arg Val Lys lie Tyr Gly Ser 

100 105 110 

Gin Phe Ser Arg Arg Gly Glu Glu Gly Leu Val Tyr Phe Pro Leu Glu 

115 120 125 

Lys Lys Ser Val Phe Ser Trp lie Arg Gly Glu lie Gly Leu Lys lie 

130 135 140 

Tyr Tyr Tyr Asp Glu Ala Ala Asp Glu Asp Thr Ala Gly Gly Gly Gly 
145 150 155 160 

Gly Gin Gin Gin Gin Gin Gin Gin Gin Gin Phe His Pro Pro Gin Gin 

165 170 175 

Glu Ala Asp Glu Gin Gin His Gin Gin Gin Phe His Pro Pro Pro Gin 

180 185 190 

Gin Met Met Asn lie Pro Pro Glu Lys Pro Asn Val Val Val Val Glu 

195 200 205 

Glu Gly Arg Val Phe Glu Ser Ala Gin Ser Gin Arg Tyr Thr Glu Thr 

210 215 220 

His Gin Gin Pro Pro Val Val lie Val Glu Glu Ser Pro Pro Gin His 
225 230 235 240 

Val Met Gin Gly Pro Asn Asp Asn His Pro His Arg Asn Asp Asn His 

245 ~ 250 ' 255 

Pro Gin Arg Pro Pro Ser Pro Pro Pro Pro Pro Ser Ala Gly Glu Val 

260 265 270 

His Tyr Tyr Pro Pro Glu Val Arg Lys Met Gin Val Gly Arg Pro Pro 

275 280 285 

Gly Gly Asp Arg lie Arg Val Thr Lys Arg Pro Pro Asn Gly Asp Tyr 
290 295 ' 300 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
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Ser 


Pro 


Arg 


Val 


He 


Asn 


Ser 


Lys 


Thr 


Gly 


Gly 


Gly 


Glu 


Thr 


Thr 


Met 




305 










310 










315 










320 




Glu 


Lys 


Lys 


Thr 


His 


His 


Pro 


Tyr 


Asn 


Leu 


Val 


Glu 


Pro 


Met 


Gin 


Tyr 












325 










330 










335 




5 


Leu 


Phe 


Val 


Arg 


He 


Val 


Lys 


Ala 


Arg 


Gly 


Leu 


Pro 


Pro 


Asn 


Glu 


Ser 










340 










345 










350 








Ala 


Tyr 


Val 


Lys 


Val 


Arg 


Thr 


Ser 


Asn 


His 


Phe 


Val 


Arg 


Ser 


Lys 


Pro 








355 










360 










365 










Ala 


Val 


Asn 


Arg 


Pro 


Gly 


Glu 


Ser 


Val 


Asp 


Ser 


Pro 


Glu 


Trp 


Asn 


Gin 


10 




370 










375 










380 












Val 


Phe 


Ala 


Leu 


Gly 


His 


Asn 


Arg 


Ser 


Asp 


Ser 


Ala 


Val 


Thr 


Gly 


Ala 




385 










390 










395 










400 




Thr 


Leu 


Glu 


He 


Ser 


Ala 


Trp 


Asp 


Ala 


Ser 


Ser 


Glu 


Ser 


Phe 


Leu 


Gly 












405 










410 










415 




15 


Gly 


Val 


Cys 


Phe 


Asp 


Leu 


Ser 


Glu 


Val 


Pro 


Val 


Arg 


Asp 


Pro 


Pro 


Asp 










420 










425 










430 








Ser 


Pro 


Leu 


Ala 


Pro 


Gin 


Trp 


Tyr 


Arg 


Leu 


Glu 


Gly 


Ser 


Gly 


Ala 


Asp 








435 










440 










445 










Gin 


Asn 


Ser 


Gly 


Arg 


He 


Ser 


Gly 


Asp 


He 


Gin 


Leu 


Ser 


Val 


Trp 


He 


20 




450 










455 










460 












Gly 


Thr 


Gin 


Val 


Asp 


Glu 


Ala 


Phe 


Pro 


Glu 


Ala 


Trp 


Ser 


Ser 


Asp 


Ala 




465 










470 










475 










480 




Pro 


His 


Val 


Ala 


His 


Thr 


Arg 


Ser 


Lys 


Val 


Tyr 


Gin 


Ser 


Pro 


Lys 


Leu 












485 










490 










495 




25 


Trp 


Tyr 


Leu 


Arg 


Val 


Thr 


Val 


Leu 


Glu 


Ala 


Gin 


Asp 


Leu 


His 


He 


Ala 










500 










505 










510 








Pro 


Asn 


Leu 


Pro 


Pro 


Leu 


Thr 


Ala 


Pro 


Glu 


He 


Arg 


Val 


Lys 


Ala 


Gin 








515 










520 










525 










Leu 


Gly 


Phe 


Gin 


Ser 


Ala 


Arg 


Thr 


Arg 


Arg 


Gly 


Ser 


Met 


Asn 


Asn 


His 


30 




530 










535 










540 












Ser 


Gly 


Ser 


Phe 


His 


Trp 


His 


Glu 


Asp 


Met 


He 


Phe 


Val 


Ala 


Gly 


Glu 




545 










550 










555 










560 




Pro 


Leu 


Glu 


Asp 


Cys 


Leu 


Val 


Leu 


Met 


Val 


Glu 


Asp 


Arg 


Thr 


Thr 


Lys 












565 










570 










575 




35 


Glu 


Ala 


Thr 


Leu 


Leu 


Gly 


His 


Ala 


Met 


He 


Pro 


Val 


Ser 


Ser 


He 


Glu 










580 










585 










590 








Gin 


Arg 


He 


Asp 


Glu 


Arg 


Phe 


Val 


Pro 


Ser 


Lys 


Trp 


His 


Thr 


Leu 


Glu 








595 










600 










605 










Gly 


Glu 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Pro 


Gly 


Gly 


Gly 


Gly 


40 




610 










615 










620 












Gly 


Gly 


Gly 


Pro 


Tyr 


Cys 


Gly 


Arg 


He 


Ser 


Leu 


Arg 


Leu 


Cys 


Leu 


Glu 




625 










630 










635 










640 




Gly 


Gly 


Tyr 


His 


Val 


Leu 


Glu 


Glu 


Ala 


Ala 


His 


Val 


Cys 


Ser 


Asp 


Phe 












645 










650 










655 




45 


Arg 


Pro 


Thr 


Ala 


Lys 


Gin 


Leu 


Trp 


Lys 


Pro 


Pro 


He 


Gly 


He 


Leu 


Glu 










660 










665 










670 








Leu 


Gly 


He 


Leu 


Gly 


Ala 


Arg 


Gly 


Leu 


Leu 


Pro 


Met 


Lys 


Ala 


Lys 


Asn 








675 










680 










685 










Gly 


Gly 


Lys 


Gly 


Ser 


Thr 


Asp 


Ala 


Tyr 


Cys 


Val 


Ala 


Lys 


Tyr 


Gly 


Lys 


50 




690 










695 










700 












Lys 


Trp 


Val 


Arg 


Thr 


Arg 


Thr 


He 


Thr 


Asp 


Ser 


Phe 


Asp 


Pro 


Arg 


Trp 




705 










710 










715 










720 




His 


Glu 


Gin 


Tyr 


Thr 


Trp 


Gin 


Val 


Tyr 


Asp 


Pro 


Cys 


Thr 


Val 


Leu 


Thr 












725 










730 










735 




55 


Val 


Gly 


Val 


Phe 


Asp 


Asn 


Trp 


Arg 


Met 


Phe 


Ser 


Asp 


Ala 


Ser 


Asp 


Asp 










740 










745 










750 








Arg 


Pro 


Asp 


Thr 


Arg 


He 


Gly 


Lys 


He 


Arg 


He 


Arg 


Val 


Ser 


Thr 


Leu 








755 










760 










765 










Glu 


Ser 


Asn 


Lys 


Val 


Tyr 


Thr 


Asn 


Ser 


Tyr 


Pro 


Leu 


Leu 


Val 


Leu 


Leu 


60 




770 










775 










780 












Pro 


Ser 


Gly 


Met 


Lys 


Lys 


Met 


Gly 


Glu 


He 


Glu 


Val 


Ala 


Val 


Arg 


Phe 



785 790 795 800 
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Ala Cys Pro Ser Leu Leu Pro Asp Val Cys Ala Ala Tyr Gly Gin Pro 

805 810 815 

Leu Leu Pro Arg Met His Tyr lie Arg Pro Leu Gly Val Ala Gin Gin 
820 825 830 

5 Asp Ala Leu Arg Gly Ala Ala Thr Lys Met Val Ala Ala Trp Leu Ala 
835 840 845 

Arg Ala Glu Pro Pro Leu Gly Pro Glu Val Val Arg Tyr Met Leu Asp 

8 50 8 55 8 60 

Ala Asp Ser His Ala Trp Ser Met Arg Lys Ser Lys Ala Asn Trp Tyr 
10 865 870 875 880 

Arg lie Val Gly Val Leu Ala Trp Ala Val Gly Leu Ala Lys Trp Leu 

885 890 895 

Asp Asn lie Arg Arg Trp Arg Asn Pro Val Thr Thr Val Leu Val His 
900 905 910 

15 lie Leu Tyr Leu Val Leu Val Trp Tyr Pro Asp Leu Val Val Pro Thr 
915 920 925 

Ala Phe Leu Tyr Val Val Met lie Gly Val Trp Tyr Tyr Arg Phe Arc* 

930 935 940 

Pro Lys lie Pro Ala Gly Met Asp lie Arg Leu Ser Gin Ala Glu Thr 
20 945 950 955 960 

Val Asp Pro Asp Glu Leu Asp Glu Glu Phe Asp Thr lie Pro Ser Ser 

965 970 975 

Arg Arg Pro Glu Val lie Arg Ala Arg Tyr Asp Arg Leu Arg lie Leu 
980 985 990 

25 Ala Val Arg Val Gin Thr lie Leu Gly Asp Phe Ala Ala Gin Gly Glu 
995 1000 1005 

Arg lie Gin Ala Leu Val Ser Trp Arg Asp Pro Arg Ala Thr Lys Leu 

1010 1015 1020 

Phe lie Ala lie Cys Leu Val lie Thr lie Val Leu Tyr Ala Val Pro 
30 1025 1030 1035 1040 

Ala Lys Met Val Ala Val Ala Leu Gly Val Ser Asp Ser Val Pro Thr 

1045 1050 1055 

Ala Lys Gin Asp Thr Lys Glu Ser Leu Lys Lys Ser Phe Ser Ser Leu 
1060 1065 1070 

3 5 Arg Phe Asp Phe Ser Ser Met Ala Val Val Gly Ala Pro lie Ser Ser 

1075 1080 1085 

Pro Ala Ala Gin Leu Gin Thr Gin Phe Leu Ser Asn Pro lie Leu Pro 

1090 1095 1100 

Arg Phe Arg Arg Ser Phe Ser Thr Gly Lys Ser Pro Ala Thr Phe Ser 
40 1105 " 1110 1115 1120 

Val Val Ala Met Ala Pro Gin Lys Lys Val Asn Lys Tyr Asp Ala Lys 

1125 1130 1135 

Trp Lys Lys Gin Trp Tyr Gly Ala Gly Leu Phe Phe Glu Gly Ser Glu 
1140 1145 1150 

4 5 Gin lie Asn Val Asp Val Phe Lys Lys Leu Glu Lys Arg Lys Val Leu 

1155 1160 1165 

Ser Asn Val Glu Lys Ser Gly Leu Leu Ser Lys Ala Glu Gly Leu Gly 

1170 1175 1180 

Leu Thr Leu Ser Ser Leu Glu Lys Leu Lys Val Phe Ser Lys Ala Glu 
50 1185 1190 1195 1200 

Asp Leu Gly Leu Leu Ser Leu Leu Glu Asn Leu Ala Gly Thr Ser Pro 

1205 1210 1215 

Ala Val Leu Ala Ser Ala Ala Leu Pro Ala Leu Thr Ala Ala lie Val 
1220 1225 1230 

55 Ala Val Val Leu lie Pro Asp Asp Ser Thr Thr Leu Val Val Ala Gin 
1235 1240 1245 

Ala Val Leu Ala Gly Ala Leu Ala Leu Thr Gly Val Val Leu Leu Val 

1250 1255 1260 

Gly Ser Val Val Leu Asp Gly Leu Gin Glu Ala Asp 
60 1265 1270 1275 

(2) INFORMATION FOR SEQ ID NO: 4 02: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..1210 

(D) OTHER INFORMATION: / Ceres Seq. ID 2028731 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

Met Leu Asp Phe Ala Val Ser Asp Pro Lys Asn Met Asp Tyr Asp Glu 
15 10 ^ 15 

Leu Asp lie Glu Val Tyr Asn Asp Lys Arg Phe Gly Asn Gly Gly Gly 
20 25 30 

15 Arg Lys Asn His Phe Leu Gly Arg Val Lys lie Tyr Gly Ser Gin Phe 
35 40 " ~ 4 5 

Ser Arg Arg Gly Glu Glu Gly Leu Val Tyr Phe Pro Leu Glu Lys Lys 

50 55 60 

Ser Val Phe Ser Trp lie Arg Gly Glu lie Gly Leu Lys lie Tyr Tyr 
20 65 70 75 ~ 80 

Tyr Asp Glu Ala Ala Asp Glu Asp Thr Ala Gly Gly Gly Gly Gly Gin 

85 90 95 

Gin Gin Gin Gin Gin Gin Gin Gin Phe His Pro Pro Gin Gin Glu Ala 
100 105 110 

2 5 Asp Glu Gin Gin His Gin Gin Gin Phe His Pro Pro Pro Gin Gin Met 
115 120 125 

Met Asn lie Pro Pro Glu Lys Pro Asn Val Val Val Val Glu Glu Gly 

130 135 140 

Arg Val Phe Glu Ser Ala Gin Ser Gin Arg Tyr Thr Glu Thr His Gin 
30 145 150 155 160 

Gin Pro Pro Val Val lie Val Glu Glu Ser Pro Pro Gin His Val Met 

165 170 175 

Gin Gly Pro Asn Asp Asn His Pro His Arg Asn Asp Asn His Pro Gin 
180 185 190 

35 Arg Pro Pro Ser Pro Pro Pro Pro Pro Ser Ala Gly Glu Val His Tyr 
195 200 " 205 

Tyr Pro Pro Glu Val Arg Lys Met Gin Val Gly Arg Pro Pro Gly Gly 

210 215 220 

Asp Arg lie Arg Val Thr Lys Arg Pro Pro Asn Gly Asp Tyr Ser Pro 
40 225 230 235 " '* 240 

Arg Val lie Asn Ser Lys Thr Gly Gly Gly Glu Thr Thr Met Glu Lys 

245 250 255 

Lys Thr His His Pro Tyr Asn Leu Val Glu Pro Met Gin Tyr Leu Phe 
260 265 270 

4 5 Val Arg He Val Lys Ala Arg Gly Leu Pro Pro Asn Glu Ser Ala Tyr 

275 280 285 

Val Lys Val Arg Thr Ser Asn His Phe Val Arg Ser Lys Pro Ala Val 

290 295 300 

Asn Arg Pro Gly Glu Ser Val Asp Ser Pro Glu Trp Asn Gin Val Phe 
50 305 310 315 320 

Ala Leu Gly His Asn Arg Ser Asp Ser Ala Val Thr Gly Ala Thr Leu 

325 330 335 

Glu He Ser Ala Trp Asp Ala Ser Ser Glu Ser Phe Leu Gly Gly Val 
340 345 350 

5 5 Cys Phe Asp Leu Ser Glu Val Pro Val Arg Asp Pro Pro Asp Ser Pro 

355 360 365 

Leu Ala Pro Gin Trp Tyr Arg Leu Glu Gly Ser Gly Ala Asp Gin Asn 

370 375 380 

Ser Gly Arg He Ser Gly Asp He Gin Leu Ser Val Trp He Gly Thr 
60 385 ' 390 395 " 400 

Gin Val Asp Glu Ala Phe Pro Glu Ala Trp Ser Ser Asp Ala Pro His 
405 410 415 
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Val Ala His Thr Arg Ser Lys Val Tyr Gin Ser Pro Lys Leu Trp Tyr 

420 425 430 

Leu Arg Val Thr Val Leu Glu Ala Gin Asp Leu His lie Ala Pro Asn 
435 440 445 

5 Leu Pro Pro Leu Thr Ala Pro Glu lie Arg Val Lys Ala Gin Leu Gly 
450 455 460 

Phe Gin Ser Ala Arg Thr Arg Arg Gly Ser Met Asn Asn His Ser Gly 
465 470 475 480 

Ser Phe His Trp His Glu Asp Met lie Phe Val Ala Gly Glu Pro Leu 
10 485 490 ' 495 

Glu Asp Cys Leu Val Leu Met Val Glu Asp Arg Thr Thr Lys Glu Ala 

500 . 505 510 

Thr Leu Leu Gly His Ala Met lie Pro Val Ser Ser lie Glu Gin Arg 
515 ^ 520 525 

15 lie Asp Glu Arg Phe Val Pro Ser Lys Trp His Thr Leu Glu Gly Glu 
530 535 540 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Pro Gly Gly Gly Gly Gly Gly 
545 550 555 ' 560 

Gly Pro Tyr Cys Gly Arg lie Ser Leu Arg Leu Cys Leu Glu Gly Gly 
2 0 565 57 0 575 

Tyr His Val Leu Glu Glu Ala Ala His Val Cys Ser Asp Phe Arg Pro 

58 0 585 ^ ^ 590 

Thr Ala Lys Gin Leu Trp Lys Pro Pro lie Gly lie Leu Glu Leu Gly 
595 600 605 

2 5 He Leu Gly Ala Arg Gly Leu Leu Pro Met Lys Ala Lys Asn Gly Gly 
610 " 615 620 

Lys Gly Ser Thr Asp Ala. Tyr Cys Val Ala Lys Tyr Gly Lys Lys Trp 
625 630 635 640 

Val Arg Thr Arg Thr He Thr Asp Ser Phe Asp Pro Arg Trp His Glu 
30 645 650 ' 655 

Gin Tyr Thr Trp Gin Val Tyr Asp Pro Cys Thr Val Leu Thr Val Gly 

660 665 670 

Val Phe Asp Asn Trp Arg Met Phe Ser Asp Ala Ser Asp Asp Arg Pro 
675 680 685 

35 Asp Thr Arg He Gly Lys lie Arg He Arg Val Ser Thr Leu Glu Ser 
690 695 700 

Asn Lys Val Tyr Thr Asn Ser Tyr Pro Leu Leu Val Leu Leu Pro Ser 
705 ' 710 715 720 

Gly Met Lys Lys Met Gly Glu lie Glu Val Ala Val Arg Phe Ala Cys 
40 " 725 730 735 

Pro Ser Leu Leu Pro Asp Val Cys Ala Ala Tyr Gly Gin Pro Leu Leu 

740 745 750 

Pro Arg Met His Tyr He Arg Pro Leu Gly Val Ala Gin Gin Asp Ala 
755 760 765 

4 5 Leu Arg Gly Ala Ala Thr Lys Met Val Ala Ala Trp Leu Ala Arg Ala 
770 775 780 

Glu Pro Pro Leu Gly Pro Glu Val Val Arg Tyr Met Leu Asp Ala Asp 
785 790 795 800 

Ser His Ala Trp Ser Met Arg Lys Ser Lys Ala Asn Trp Tyr Arg He 
50 805 810 ~ 815 

Val Gly Val Leu Ala Trp Ala Val Gly Leu Ala Lys Trp Leu Asp Asn 

820 825 830 

He Arg Arg Trp Arg Asn Pro Val Thr Thr Val Leu Val His He Leu 
835 840 845 

55 Tyr Leu Val Leu Val Trp Tyr Pro Asp Leu Val Val Pro Thr Ala Phe 
850 855 860 

Leu Tyr Val Val Met He Gly Val Trp Tyr Tyr Arg Phe Arg Pro Lys 
865 870 875 880 

He Pro Ala Gly Met Asp He Arg Leu Ser Gin Ala Glu Thr Val Asp 
60 885 890 895 

Pro Asp Glu Leu Asp Glu Glu Phe Asp Thr He Pro Ser Ser Arg Arg 
900 905 910 
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Pro Glu Val lie Arg Ala Arg Tyr Asp Arg Leu Arg lie Leu Ala Val 

915 920 925 

Arg Val Gin Thr lie Leu Gly Asp Phe Ala Ala Gin Gly Glu Arg He 
930 935 940 

5 Gin Ala Leu Val Ser Trp Arg Asp Pro Arg Ala Thr Lys Leu Phe He 
945 950 955 960 

Ala He Cys Leu Val He Thr He Val Leu Tyr Ala Val Pro Ala Lys 

965 970 975 

Met Val Ala Val Ala Leu Gly Val Ser Asp Ser Val Pro Thr Ala Lys 
10 980 985 990 

Gin Asp Thr Lys Glu Ser Leu Lys Lys Ser Phe Ser Ser Leu Arg Phe 

995 ~ 1000 " 1005 

Asp Phe Ser Ser Met Ala Val Val Gly Ala Pro He Ser Ser Pro Ala 
1010 1015 1020 

15 Ala Gin Leu Gin Thr Gin Phe Leu Ser Asn Pro He Leu Pro Arg Phe 
1025 1030 1035 1040 

Arg Arg Ser Phe Ser Thr Gly Lys Ser Pro Ala Thr Phe Ser Val Val 

1045 " ' 1050 1055 

Ala Met Ala Pro Gin Lys Lys Val Asn Lys Tyr Asp Ala Lys Trp Lys 
20 1060 1065 1070 

Lys Gin Trp Tyr Gly Ala Gly Leu Phe Phe Glu Gly Ser Glu Gin He 

1075 ~ 1080 " 1085 

Asn Val Asp Val Phe Lys Lys Leu Glu Lys Arg Lys Val Leu Ser Asn 
1090 1095 1100 

25 Val Glu Lys Ser Gly Leu Leu Ser Lys Ala Glu Gly Leu Gly Leu Thr 
1105 ~ " 1110 ~ 1115 " " J 1120 

Leu Ser Ser Leu Glu Lys Leu Lys Val Phe Ser Lys Ala Glu Asp Leu 

1125 " 1130 1135 

Gly Leu Leu Ser Leu Leu Glu Asn Leu Ala Gly Thr Ser Pro Ala Val 
30 " 1140 1145 1150 

Leu Ala Ser Ala Ala Leu Pro Ala Leu Thr Ala Ala He Val Ala Val 

1155 1160 1165 

Val Leu He Pro Asp Asp Ser Thr Thr Leu Val Val Ala Gin Ala' Val 
1170 1175 1180 

35 Leu Ala Gly Ala Leu Ala Leu Thr Gly Val Val Leu Leu Val Gly Ser 
1185 " 1190 1195 ~ 1200 

Val Val Leu Asp Gly Leu Gin Glu Ala Asp 

1205 1210 
(2) INFORMATION FOR SEQ ID NO: 4 03: 
4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 1199 

(D) OTHER INFORMATION: / Ceres Seq. ID 2028732 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

Met Asp Tyr Asp Glu Leu Asp He Glu Val Tyr Asn Asp Lys Arg Phe 
1 5 10 15 

Gly Asn Gly Gly Gly Arg Lys Asn His Phe Leu Gly Arg Val Lys He 
20 25 30 

55 Tyr Gly Ser Gin Phe Ser Arg Arg Gly Glu Glu Gly Leu Val Tyr Phe 
35 4 0 ~ 45 

Pro Leu Glu Lys Lys Ser Val Phe Ser Trp He Arg Gly Glu He Gly 

50 ~ 55 60 

Leu Lys He Tyr Tyr Tyr Asp Glu Ala Ala Asp Glu Asp Thr Ala Gly 
60 65 "* 70 75 " 80 

Gly Gly Gly Gly Gin Gin Gin Gin Gin Gin Gin Gin Gin Phe His Pro 
85 90 95 
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Pro Gin Gin Glu Ala Asp Glu Gin Gin His Gin Gin Gin Phe His Pro 

100 105 110 

Pro Pro Gin Gin Met Met Asn lie Pro Pro Glu Lys Pro Asn Val Val 
115 120 125 

5 Val Val Glu Glu Gly Arg Val Phe Glu Ser Ala Gin Ser Gin Arg Tyr 
130 135 140 

Thr Glu Thr His Gin Gin Pro Pro Val Val lie Val Glu Glu Ser Pro 
145 150 155 160 

Pro Gin His Val Met Gin Gly Pro Asn Asp Asn His Pro His Arq Asn 
10 165 170 175 

Asp Asn His Pro Gin Arg Pro Pro Ser Pro Pro Pro Pro Pro Ser Ala 

180 185 190 

Gly Glu Val His Tyr Tyr Pro Pro Glu Val Arg Lys Met Gin Val Gly 
195 " " 200 205 

15 Arg Pro Pro Gly Gly Asp Arg lie Arg Val Thr Lys Arg Pro Pro Asn 
210 " ~ 215 220 

Gly Asp Tyr Ser Pro Arg Val lie Asn Ser Lys Thr Gly Gly Gly Glu 
225 230 235 ~ 240 

Thr Thr Met Glu Lys Lys Thr His His Pro Tyr Asn Leu Val Glu Pro 
20 245 250 255 

Met Gin Tyr Leu Phe Val Arg lie Val Lys Ala Arg Gly Leu Pro Pro 

260 265 270 

Asn Glu Ser Ala Tyr Val Lys Val Arg Thr Ser Asn His Phe Val Arg 
275 280 285 

25 Ser Lys Pro Ala Val Asn Arg Pro Gly Glu Ser Val Asp Ser Pro Glu 
290 295 300 

Trp Asn Gin Val Phe Ala Leu Gly His Asn Arg Ser Asp Ser Ala Val 
305 310 315 320 

Thr Gly Ala Thr Leu Glu lie Ser Ala Trp Asp Ala Ser Ser Glu Ser 
30 325 330 335 

Phe Leu Gly Gly Val Cys Phe Asp Leu Ser Glu Val Pro Val Arg Asp 

340 345 350 

Pro Pro Asp Ser Pro Leu Ala Fro Gin Trp Tyr Arg Leu Glu Gly Ser 
355 360 365 

35 Gly Ala Asp Gin Asn Ser Gly Arg lie Ser Gly Asp lie Gin Leu Ser 
370 375 380 

Val Trp lie Gly Thr Gin Val Asp Glu Ala Phe Pro Glu Ala Trp Ser 
385 ^ 390 395 400 

Ser Asp Ala Pro His Val Ala His Thr Arg Ser Lys Val Tyr Gin Ser 
40 405 410 415 

Pro Lys Leu Trp Tyr Leu Arg Val Thr Val Leu Glu Ala Gin Asp Leu 

420 " 425 430 

His lie Ala Pro Asn Leu Pro Pro Leu Thr Ala Pro Glu lie Arg Val 
435 440 445 

4 5 Lys Ala Gin Leu Gly Phe Gin Ser Ala Arg Thr Arg Arg Gly Ser Met 
450 455 460 

Asn Asn His Ser Gly Ser Phe His Trp His Glu Asp Met lie Phe Val 
465 470 475 480 

Ala Gly Glu Pro Leu Glu Asp Cys Leu Val Leu Met Val Glu Asp Arg 
50 485 490 495 

Thr Thr Lys Glu Ala Thr Leu Leu Gly His Ala Met lie Pro Val Ser 

500 505 510 

Ser lie Glu Gin Arg lie Asp Glu Arg Phe Val Pro Ser Lys Trp His 
515 520 525 

55 Thr Leu Glu Gly Glu Gly Gly Gly Gly Gly Gly Gly Gly Gly Pro Gly 
530 535 540 

Gly Gly Gly Gly Gly Gly Pro Tyr Cys Gly Arg lie Ser Leu Arg Leu 
545 " " 550 * 555 560 

Cys Leu Glu Gly Gly Tyr His Val Leu Glu Glu Ala Ala His Val Cys 
60 - " 565 570 575 

Ser Asp Phe Arg Pro Thr Ala Lys Gin Leu Trp Lys Pro Pro lie Gly 
580 ~ 585 590 
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He Leu Glu Leu Gly He Leu Gly Ala Arg Gly Leu Leu Pro Met Lys 

595 600 605 

Ala Lys Asn Gly Gly Lys Gly Ser Thr Asp Ala Tyr Cys Val Ala Lys 
610 615 620 

5 Tyr Gly Lys Lys Trp Val Arg Thr Arg Thr He Thr Asp Ser Phe Asp 
625 630 635 640 

Pro Arg Trp His Glu Gin Tyr Thr Trp Gin Val Tyr Asp Pro Cys Thr 

645 650 655 

Val Leu Thr Val Gly Val Phe Asp Asn Trp Arg Met Phe Ser Asp Ala 
10 660 665 670 

Ser Asp Asp Arg Pro Asp Thr Arg He Gly Lys He Arg He Arg Val 

675 680 685 

Ser Thr Leu Glu Ser Asn Lys Val Tyr Thr Asn Ser Tyr Pro Leu Leu 
690 695 700 

15 Val Leu Leu Pro Ser Gly Met Lys Lys Met Gly Glu He Glu Val Ala 
705 710 715 720 

Val Arg Phe Ala Cys Pro Ser Leu Leu Pro Asp Val Cys Ala Ala Tyr 

725 730 735 

Gly Gin Pro Leu Leu Pro Arg Met His Tyr He Arg Pro Leu Gly Val 
20 740 745 750 

Ala Gin Gin Asp Ala Leu Arg Gly Ala Ala Thr Lys Met Val Ala Ala 

755 760 765 

Trp Leu Ala Arg Ala Glu Pro Pro Leu Gly Pro Glu Val Val Arg Tyr 
770 775 780 

2 5 Met Leu Asp Ala Asp Ser His Ala Trp Ser Met Arg Lys Ser Lys Ala 

785 790 795 " 800 

Asn Trp Tyr Arg He Val Gly Val Leu Ala Trp Ala Val Gly Leu Ala 

805 810 815 

Lys Trp Leu Asp Asn lie Arg Arg Trp Arg Asn Pro Val Thr Thr Val 
30 ^ 820 825 830 

Leu Val His He Leu Tyr Leu Val Leu Val Trp Tyr Pro Asp Leu Val 

835 " 840 845 

Val Pro Thr Ala Phe Leu Tyr Val Val Met lie Gly Val Trp Tyr Tyr 
850 855 860 

3 5 Arg Phe Arg Pro Lys He Pro Ala Gly Met Asp He Arg Leu Ser Gin 

865 ' 870 875 880 

Ala Glu Thr Val Asp Pro Asp Glu Leu Asp Glu Glu Phe Asp Thr He 

885 890 895 

Pro Ser Ser Arg Arg Pro Glu Val He Arg Ala Arg Tyr Asp Arg Leu 

4 0 900 905 910 

Arg He Leu Ala Val Arg Val Gin Thr He Leu Gly Asp Phe Ala Ala 

915 920 925 

Gin Gly Glu Arg He Gin Ala Leu Val Ser Trp Arg Asp Pro Arg Ala 
930 935 940 

4 5 Thr Lys Leu Phe He Ala He Cys Leu Val He Thr He Val Leu Tyr 
945 950 955 960 

Ala Val Pro Ala Lys Met Val Ala Val Ala Leu Gly Val Ser Asp Ser 

965 970 975 

Val Pro Thr Ala Lys Gin Asp Thr Lys Glu Ser Leu Lys Lys Ser Phe 
50 980 " 985 990 

Ser Ser Leu Arg Phe Asp Phe Ser Ser Met Ala Val Val Gly Ala Pro 

995 1000 1005 

He Ser Ser Pro Ala Ala Gin Leu Gin Thr Gin Phe Leu Ser Asn Pro 
1010 1015 1020 

55 He Leu Pro Arg Phe Arg Arg Ser Phe Ser Thr Gly Lys Ser Pro Ala 
1025 1030 1035 1040 

Thr Phe Ser Val Val Ala Met Ala Pro Gin Lys Lys Val Asn Lys Tyr 

1045 1050 " 1055 

Asp Ala Lys Trp Lys Lys Gin Trp Tyr Gly Ala Gly Leu Phe Phe Glu 
60 ^ 1060 " 1065 1070 

Gly Ser Glu Gin He Asn Val Asp Val Phe Lys Lys Leu Glu Lys Arg 
1075 1080 ' 1085 
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(2) INFORMATION FOR SEQ ID NO: 40 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

{A) NAME/ KEY : peptide 
(B) LOCATION: 1 . . 10 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029081 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 



Met 


Ser 


Val 


Ser 


Ala 


lie 


Phe 


Gly 


Thr 


Gly 


He 


Val 


Thr 


Val 


Ala 


Ala 










5 










10 










15 




Ser 


Pro 


v a 1 


Leu 

20 


Arg 


Gin 


Phe 


Gin 


Val 
25 


Pro 


Lys 


Leu 


Gly 


Asn 

30 


Gly 


Gly 


Gl y 


Leu 


Gly 
35" 


Met 


Val 


He 


Glu 


Cys 
40 


Ser 


Ser 


Arg 


Pro 


Gin 
45 


Lys 


Lys 


Ser 


Thr 


Ala 

50 


His 


His 


Arg 


Lys 


Thr 

55 


Arg 


Pro 


Lys 


Lys 


Thr 

60 


Gin 


Pro 


Trp 


Asp 


lie 


LtVS 


Arg 


Lys 


Pro 


Thr 


Val 


Tyr 


Ala 


Pro 


Leu 


Pro 


Pro 




Pro 


Al- 


65 










7 0 










7 5 










SO 


Glu 


Trp 


Ser 


Pro 


Phe 
85 


Thr 


Leu 


Ala 


Ser 


Asn 
90 


Asp 


Gly 


Gly 


Ala 


Ala 
95 


Val 


Ala 


Ala 


Ser 


Pro 


Ala 


Gly 


Asp 


Leu 


Val 


Ser 


Gly 


Ser 


Ala 









100 105 
(2) INFORMATION FOR SEQ ID NO: 4 07: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
;C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 74 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029082 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 
Met Val He Glu Cys Ser Ser Arg Pro Gin Lys Lys Ser Thr Ala His 
15 10 15 

His Arg Lys Thr Arg Pro Lys Lys Thr Gin Pro Trp Asp He Lys Arg 

20 25 30 

Lys Pro Thr Val Tyr Ala Pro Leu Pro Pro Leu Pro Ala Glu Trp Ser 

35 4 0 4 5 

Pro Phe Thr Leu Ala Ser Asn Asp Gly Gly Ala Ala Val Ala Ala Ser 

50 55 60 

Pro Ala Gly Asp Leu Val Ser Gly Ser Ala 
65 " ~ 7 0 

(2) INFORMATION FOR SEQ ID NO: 4 08: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..7 87 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029225 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 
agggattcta gggctttcct tcattggtca gatcagacga cgttttacat cttcttcttc 60 
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ttcctcttcg atattcgtca gtgtgtgtat tttggggaaa actttgtgag caaagagcga 120 

gaaaatgagc ggagccggta agaaaatcgc ggatgtggct ttcaaagctt caaggactat 180 

cgattgggat ggtatggcta aggtccttgt cacagatgag gctcgtagag agttctctaa 240 

ccttcgtcgt gctttcgatg aggttaacac acagctccag accaaattta gtcaggaacc 300 

tgaacctata gattgggatt actataggaa gggtattgga gctggcattg ttgacaagta 360 

caaggaagct tatgacagca ttgagattcc aaagtacgtt gacaaagtta ctcctgaata 420 

caagccaaag tttgatgctt tgttggtgga actgaaagaa gcagaacaga aatcgctcaa 480 

ggagtctgaa cggttggaga aagaaattgc tgatgtccaa gagatcagca aaaagctcag 540 

caccatgact gcagatgagt actttgagaa gcacccggaa ctcaaaaaga agtttgatga 600 

cgaaatccgt aatgacaact ggggatactg atcatgtttc tccatctccg gcttggaaag 660 

aaaactctct ttctctttct ctgttctctt actgtgattt tgtgagccaa tcataacaat 720 

aataagtaca ccattcacti aagcagtgtt gagatcttca ttccaaggaa gataaacgca 780 



tttggtt 

(2) INFORMATION FOR SEQ ID NO : 4 0 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

{ A ) NAME /KEY: peptide 
(B) LOCATION: 1. .168 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029226 





(xi) 


SEQUENCE DESCRIPTION: SEQ 


I D NO: 4 09 : 










Met 


Ser 


Gly 


Ala 


Gly 


Lys 


Lys 


He 


Ala 


Asp 


Val 


Ala 


Phe 


Lys 


Ala 


Ser 


1 








5 










10 










15 




Arg 


Thr 


He 


Asp 


Trp 


Asp 


Gly 


Met 


Ala 


Lys 


Val 


Leu 


Val 


Thr 


Asp 


Glu 








20 










25 










30 






Ala 


Arg 


Arg 


Glu 


Phe 


Ser 


Asn 


Leu 


Arg 


Arg 


Ala 


Phe 


Asp 


Glu 


Val 


Asn 






35 










40 










45 








Thr 


Gin 


Leu 


Gin 


Thr 


Lys 


Phe 


Ser 


Gin 


Glu 


Pro 


Glu 


Pro 


He 


Asp 


Trp 




50 










55 










60 










Asp 


Tyr 


Tyr 


Arg 


Lys 


Gly 


He 


Gly 


Ala 


Gly 


He 


Val 


Asp 


Lys 


Tyr 


Lys 


65 










70 










75 










80 


Glu 


Ala 


Tyr 


A.sp 


Ser 


He 


Glu 


He 


Pro 


Lys 


Tyr 


Val 


Asp 


Lys 


Val 


Thr 










85 










90 










95 




Pro 


Glu 


Tyr 


Lys 


Pro 


Lys 


Phe 


A.sp 


Ala 


Leu 


Leu 


Val 


Glu 


Leu 


Lys 


Glu 








100 










105 










110 






Ala 


Glu 


Gin 


Lys 


Ser 


Leu 


Lys 


Glu 


Ser 


Glu 


Arg 


Leu 


Glu 


Lys 


Glu 


He 






115 










120 










125 








Ala 


Asp 


Val 


Gin 


Glu 


He 


Ser 


Lys 


Lys 


Leu 


Ser 


Thr 


Met 


Thr 


Ala 


A.sp 




130 










135 










140 










Glu 


Tyr 


Phe 


Glu 


Lys 


His 


Pro 


Glu 


Leu 


Lys 


Lys 


Lys 


Phe 


Asp 


Asp 


Glu 


145 










150 










155 










160 


He 


Arg 


Asn 


Asp 


Asn 


Trp 


Gly 


Tyr 



















165 



(2) INFORMATION FOR SEQ ID NO : 4 1 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .14 5 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029227 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 
Met Ala Lys Val Leu Val Thr Asp Glu Ala Arg Arg Glu Phe Ser Asn 
15 10 15 

Leu Arg Arg Ala Phe Asp Glu Val Asn Thr Gin Leu Gin Thr Lys Phe 
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20 25 30 



Ser 


Gin 


Glu 


Pro 


Glu 


Pro 


He 


Asp 


Trp 


Asp 


Tyr 


Tyr 


Ara 


Lys 


Gly 


He 






35 










40 










45 








Gly 


Ala 


Gly 


He 


Val 


Asp 


Lys 


Tyr 


Lys 


Glu 


Ala 


Tyr 


Asp 


Ser 


He 


Glu 




50 










55 










60 










He 


Pro 


Lys 


Tyr 


Val 


Asp 


Lys 


Val 


Thr 


Pro 


Glu 


Tyr 


Lys 


Pro 


Lys 


Phe 


65 










70 










75 










80 


Asp 


Ala 


Leu 


Leu 


Val 


Glu 


Leu 


Lys 


Glu 


Ala 


Glu 


Gin 


Lys 


Ser 


Leu 


Lys 










85 










90 










95 




Glu 


Ser 


Glu 


Arg 


Leu 


Glu 


Lys 


Glu 


He 


Ala 


Asp 


Val 


Gin 


Glu 


He 


Ser 








100 










105 










110 






Lys 


Lys 


Leu 


Ser 


Thr 


Met 


Thr 


Ala 


Asp 


Glu 


Tyr 


Phe 


Glu 


Lys 


His 


Pro 






115 










120 










125 








Glu 


Leu 


Lys 


Lys 


Lys 


Phe 


Asp 


Asp 


Glu 


He 


Arg 


Asn 


Asp 


Asn 


Trp 


Gly 




130 










135 










140 










Tyr 
































145 
































(2) 


INFORMATION 


FOR 


SEQ 


id : 


NO : 4 1 1 : 
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(1) SEQUENCE CHARACTERISTICS: 



20 (A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
2 5 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .15 87 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029422 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

30 gtagtctctc tttttataac cacttctcga aaactgaaac ctttgtagag 
gttcgataaa acattctttt tgcaactgag acttggcaac ttggttttac 
atttcttctt gttgatggtg ataagttcaa tatgatgaaa agtttggcra 
agggaagacg gcgagggcat gtgatagctg cgtgaagagg cgggcacgtt 
agctgatgat gcctttcttt gccatgcttg tgacggttcg gtccactcgg 

35 tgctcgtagg cacgagagag ttcgcttgaa atcggctagc gccggaaagt 
ctcgccgcct caccaagcca cgtggcatca gggatttaca cgtaaagctc 
tggaggcaag aagagccaca cgatggtttt tcatgatctt gtgccggaga 
ggatcaagcg gagagttacg aggtggaaga qcagctcata t^tgaggrgc 
ctcgatggtt gaggagcaat gctttaacca atccctggag aaacagaatg 

4 0 gatgccctta agtttcaaga gtagtgacga agaagatgac gacaacqctg 
gaatggtttg ttcccaaccg acatggaact agctcagttc acagctgacg 
actcggtgga ggggatcgag agtttcattc catagaagaa ctagggttag 
aaagatcgaa aaagaggagg tggaggaaga gggagttgtg acaagagaag 
agatgaaggt gatgagacat ccccatttga aataagcttt gactacgagt 

4 5 gaccacattc gatgaaggag aagaagatga gaaagaagac gtgatgaaga 
gatgggagtg aatgagatga gtggtgggat taaagaagag aagaaggaga 
gcttagattg gactatgaat cagtcatttc cacttgggga ggccaaggga 
cgcccgggtg ccatctgaaa tagacctcga catggtttgt ttcccaaccc 
tgaaagtgga gcagaggctc atcatcacaa ccacttccgc ggcctagggt 

50 agatgctggg gatggaggaa gagaggctag ggtttcaaga taccgagaga 
aaggttgttc tccaagaaga taaggtacga ggtacgtaaa rtgaatgcag 
tcgcatgaaa ggaaggttcg tcaagagatc ttcaattggt gttgctcact 
attaattatg gatattaaat tactttgctc tcatcttgct tttttgttgc 
gtgattgtta gctttctttt tctgcattca tagagaattt tgcacgtttt 

55 gtatgtacat aaatatatca ccaaaaaatg tgactatctt graagcacrg 
tcgatataac gtgaattttg attgctg 
(2) INFORMATION FOR SEQ ID NO: 412: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 amino acids 

60 <B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



agaacccat a 


60 


tcaaagtaag 


120 


gtgcggttgg 


180 


ggtattgcgc 


240 


caaaccctct 


300 


at cgccatgc 


360 


ggaccccacg 


420 


rgagcacgga 


480 


cggtgar gaa 


540 


aguttccaat 


600 


agagttgtct 


660 


tggagact ct 


720 


gtgagatgtt 


780 


tgcatgatca 


840 


acacacacaa 


900 


atgtgatgga 


960 


aggctcttat 


1020 


tcccatggac 


1080 


ataccatggg 


1140 


tacacctagg 


1200 


aaaggaggac 


1260 


agaaaaggcc 


1320 


aaagaactta 


1380 


tatagttttg 


1440 


tgtgagctac 


1500 


atttatatag 


1560 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .4 06 

5 (D) OTHER INFORMATION: / Ceres Seq. ID 2029423 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 
Met Met Lys Ser Leu Ala Ser Ala Val Gly Gly Lys Thr Ala Arg Ala 
1 5 1(T 15 

Cys Asp Ser Cys Val Lys Arg Arg Ala Arg Trp Tyr Cys Ala Ala Asp 
10 20 25 30 

Asp Ala Phe Leu Cys His Ala Cys Asp Gly Ser Val His Ser Ala Asn 

35 ~' 4 0 ^ 4 5 

Pro Leu Ala Arg Arg His Glu Arg Val Arg Leu Lys Ser Ala Ser Ala 
5 0 5 5 60 

15 Gly Lys Tyr Arg His Ala Ser Pro Pro His Gin Ala Thr Trp His Gin 
65" 70 75 80 

Gly Phe Thr Arg Lys Ala Arg Thr Pro Arg Gly Gly Lys Lys Ser His 

85 90 95 

Thr Met Val Phe His Asp Leu Val Pro Glu Met Ser Thr Glu Asp Gin 
20 100 105 110 

Ala Glu Ser Tyr Glu Val Glu Glu Gin Leu He Phe Glu Val Pro Val 

115 120 125 

Met Asn Ser Met Val Glu Glu Gin Cys Phe Asn Gin Ser Leu Glu Lys 
130 135 140 

25 Gin Asn Glu Phe Pro Met Met Pro Leu Ser Phe Lys Ser Ser Asp Glu 
145 150 155 ~ 160 

Glu Asp Asp Asp Asn Ala Glu Ser Cys Leu Asn Gly Leu Phe Pro Thr 

165 170 175 

Asp Met Glu Leu Ala Gin Phe Thr Ala Asp Val Glu Thr Leu Leu Gly 
30 180 185 ~ 190 

Gly Gly Asp Arg Glu Phe His Ser He Glu Glu Leu Gly Leu Gly Glu 

195 200 205 

Met Leu Lys He Glu Lys Glu Glu Val Glu Glu Glu Gly Val Val Thr 
210 ' 215 220 

35 Arg Glu Val His Asp Gin Asp Glu Gly Asp Glu Thr Ser Pro Phe Glu 
225 230 235 240 

He Ser Phe Asp Tyr Glu Tyr Thr His Lys Thr Thr Phe Asp Glu Gly 

245 ' 250 255 

Glu Glu Asp Glu Lys Glu Asp Val Met Lys Asn Val Met Glu Met Gly 
40 260 265 270 

Val Asn Glu Met Ser Gly Gly He Lys Glu Glu Lys Lys Glu Lys Ala 

275 ^ 280 285 

Leu Met Leu Arg Leu Asp Tyr Glu Ser Val He Ser Thr Trp Gly Gly 
290 295 300 

4 5 Gin Gly He Pro Trp Thr Ala Arg Val Pro Ser Glu He Asp Leu Asp 
305 " ~ 310 315 320 

Met Val Cys Phe Pro Thr His Thr Met Gly Glu Ser Gly Ala Glu Ala 

325 330 335 

His His His Asn His Phe Arg Gly Leu Gly Leu His Leu Glv Asp Ala 
50 340 345 350 

Gly Asp Gly Gly Arg Glu Ala Arg Val Ser Arg Tyr Arg Glu Lys Arg 

355 360 365 

Arg Thr Arg Leu Phe Ser Lys Lys He Arg Tyr Glu Val Arg Lys Leu 
370 375 380 

55 Asn Ala Glu Lys Arg Pro Arg Met Lys Gly Arg Phe Val Lys Arg Ser 
385 390 395 400 

Ser He Gly Val Ala His 
405 

(2) INFORMATION FOR SEQ ID NO: 4 13: 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 05 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

5 (A) NAME/KEY: peptide 

(B) LOCATION: 1. .4 05 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029424 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 
Met Lvs Ser Leu Ala Ser Ala Val Gly Gly Lys Thr Ala Arg Ala Cys 
10 1 5 10 15 

Asp Ser Cys Val Lys Arg Arg Ala Arg Trp Tyr Cys Ala Ala Asp Asp 

20 25 30 

Ala Phe Leu Cys His Ala Cys Asp Gly Ser Val His Ser Ala Asn Pro 
35 40 45 

15 Leu Ala Arg Arg His Glu Arg Val Arg Leu Lys Ser Ala Ser Ala Gly 
50 55 60 

Lys Tyr Arg His Ala Ser Pro Pro His Gin Ala Thr Trp His Gin Gly 
65 70 75 80~ 

Phe Thr Arg Lys Ala Arg Thr Pro Arg Gly Gly Lys Lys Ser His Thr 
20 85 90 95 

Met Val Phe His Asp Leu Val Pro Glu Met Ser Thr Glu Asp Gin Ala 

100 105 lib 

Glu Ser Tyr Glu Val Glu Glu Gin Leu He Phe Glu Val Pro Val Met 
115 120 125 

2 5 Asn Ser Met Val Glu Glu Gin Cys Phe Asn Gin Ser Leu Glu Lys Gin 

130 135 140 

Asn Glu Phe Pro Met Met Pro Leu Ser Phe Lys Ser Ser Asp Glu Glu 
145 150 155 160 

Asp Asp Asp Asn Ala Glu Ser Cys Leu Asn Gly Leu Phe Pro Thr Asp 
30 165 170 175 

Met Glu Leu Ala Gin Phe Thr Ala Asp Val Glu Thr Leu Leu Gly Gly 

180 185 190 

Gly Asp Arg Glu Phe His Ser He Glu Glu Leu Gly Leu Gly Glu Met 
195 200 205 

3 5 Leu Lys He Glu Lys Glu Glu Val Glu Glu Glu Gly Val Val Thr Arg 

210 215 220 

Glu Val His Asp Gin Asp Glu Gly Asp Glu Thr Ser Pro Phe Glu He 
225 230 235 240 

Ser Phe Asp Tyr Glu Tyr Thr His Lys Thr Thr Phe Asp Glu Gly Glu 
40 245 250 255 

Glu Asp Glu Lys Glu Asp Val Met Lys Asn Val Met Glu Met Gly Val 

260 265 270 

Asn Glu Met Ser Gly Gly He Lys Glu Glu Lys Lys Glu Lys Ala Leu 
275 280 285 

4 5 Met Leu Arg Leu Asp Tyr Glu Ser Val He Ser Thr Trp Gly Gly Gin 

290 295 300 

Gly He Pro Trp Thr Ala Arg Val Pro Ser Glu lie Asp Leu Asp Met 
305 310 315 320 

Val Cys Phe Pro Thr His Thr Met Gly Glu Ser Gly Ala Glu Ala His 
50 325 330 335 

His His Asn His Phe Arg Gly Leu Gly Leu His Leu Gly Asp Ala Gly 

34 0 34 5 35 0 

Asp Gly Gly Arg Glu Ala Arg Val Ser Arg Tyr Arg Glu Lys Arg Arg 
355 360 365 

55 Thr Arg Leu Phe Ser Lys Lys lie Arg Tyr Glu Val Arg Lys Leu Asn 
370 375 380 

Ala Glu Lys Arg Pro Arg Met Lys Gly Arg Phe Val Lys Arg Ser Ser 
385 390 395 400 

lie Gly Val Ala His 
60 405 

(2) INFORMATION FOR SEQ ID NO: 4 14: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..309 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029425 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 414: 

Met Val Phe His Asp Leu Val Pro Glu Met Ser Thr Glu Asp Gin Ala 
15 10 15 

Glu Ser Tyr Glu Val Glu Glu Gin Leu lie Phe Glu Val Pro Val Met 
20 25 30 

15 Asn Ser Met Val Glu Glu Gin Cys Phe Asn Gin Ser Leu Glu Lys Gin 
35 40 45 

Asn Glu Phe Pro Met Met Pro Leu Ser Phe Lys Ser Ser Asp Glu Glu 

50 55 60 

Asp Asp Asp Asn Ala Glu Ser Cys Leu Asn Gly Leu ?he Pro Thr Asp 
20 65 ~ 70 75 60 

Met Glu Leu Ala Gin Phe Thr Ala Asp Val Glu Thr Leu Leu Gly Gly 

85 90 95 

Gly Asp Arg Glu Phe His Ser lie Glu Glu Leu Gly Leu Gly Glu Met 
100 105 110 

25 Leu Lys lie Glu Lys Glu Glu Val Glu Glu Glu Gly Val Val Thr Arg 
115 120 125 

Glu Val His Asp Gin Asp Glu Gly Asp Glu Thr Ser Fro Phe Glu lie 

130 L 135 140 

Ser Phe Asp Tyr Glu Tyr Thr His Lys Thr Thr Phe Asp Glu Gly Glu 
30 145 150 155 160 

Glu Asp Glu Lys Glu Asp Val Met Lys Asn Val Met Glu Met Gly Val 

165 170 175 

Asn Glu Met Ser Gly Gly lie Lys Glu Glu Lys Lys Glu Lys Ala Leu 
180 185 190 

35 Met Leu Arg Leu Asp Tyr Glu Ser Val lie Ser Thr Trp Gly Gly Gin 
195 200 205 

Gly lie Pro Trp Thr Ala Arg Val Pro Ser Glu lie Asp Leu Asp Met 

210 215 220 

Val Cys Phe Pro Thr His Thr Met Gly Glu Ser Gly Ala Glu Ala His 
40 225 230 235 240 

His His Asn His Phe Arg Gly Leu Gly Leu His Leu Gly Asp Ala Gly 

245 250 255 

Asp Gly Gly Arg Glu Ala Arg Val Ser Arg Tyr Arg Glu Lys Arg Arg 
260 265 270 

4 5 Thr Arg Leu Phe Ser Lys Lys lie Arg Tyr Glu Val Arg Lys Leu Asn 
275 280 285 

Ala Glu Lys Arg Pro Arg Met Lys Gly Arg Phe Val Lys Arg Ser Ser 

290 295 300 

lie Gly Val Ala His 
50 305 

(2) INFORMATION FOR SEQ ID NO : 4 1 5 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
60 (B) LOCATION: 1..1300 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029806 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 
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crcaagtttc ttgcaattac ctacttacaa caaagcaact cgactcgaac aaacacatcc 60 

atggctcctt ctgcgcaacc tcttcctgtg agtgtttcgg atgaaaaata tgcgaatgtc 120 

aagtgggaag agttggcatt caagtttgtt cgtacggatt atatgtatgt tgcgaagtgc 180 

aatcatggag agagttttca agaggggaag attcttcctt ttgctgattt gcaacttaac 240 

ccttgcgctg ctgttcttca gtatggccag ggtttatatg aaggactgaa agcttacagg 300 

acagaagatg gtcggattct gctattccga ccagaccaaa acggtctccg ccttcaagcc 360 

ggagctgaca gactctatat gccttatccr tcggtcgatc aattcgtctc cgccatcaaa 420 

caagttgctc ttgccaacaa gaaatggart cctcctccgg ggaaaggaac attgtatatt 480 

aggcctatct tgtttgggag tggtccgart cttggttcat ttcccattcc tgagaccacc 540 

ttcacagctt ttgcctgtcc tgttggacgt tatcataagg ataactctgg tttgaatctg 600 

aaaatcgaag atcagtttcg tcgagctt" cctagtggaa crggtggtgt gaagagcatc 660 

acaaactatt gtcctgtttg gataccattg gcagaggcga aaaaacaagg tttctctgat 720 

attttgtttt tggatgctgc aactggcaaa aacatrgaag aacttttcgc agctaatgtt 780 

tttatgctca agggcaatgt tgtatcgaca ccaacaattg caggaactat tttgcccgga 840 

gtcactcgaa actgcgtaat ggaattgrgt cgtgatttcg gctaccaggt cgaggaacgt 900 

acgattcctc tagtggactt tctcgatgcg gacgaagctt retgtactgg cactgcLtcc 960 

attgtgacta gtattgcatc cgtaaccut- aaagacaaaa agaccggatt caaaacaggg 1020 

gaagaaacat tggctgcgaa gctatacgag acgttaagrg aratccagac gggtcgggtc 1080 

gaggatacca agggatggac ggtggaga" gaccgccagg gctgaaagtt gaaactgtaa 1140 

cttgatgcta aatatgtgtg tgtgtgtata caaaacttat gtaagaaaca tctgaagatg 1200 

tctcrgatct ttgtgattgt gatgatcarg catgtcctat cgttgcgacg tartttataa 1260 



atgttcgtct gtaagttatt taataacra: ggcttetitgc 
(2} INFORMATION FOR SEQ ID NO : 4 1 6 : 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 amine acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 374 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029807 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 6 : 



Leu 


Lys 


Phe 


Leu 


Ala 


He 


Thr 


Tyr 


Leu 


Gin 


Gin 


Ser 


Asn 


Ser 


Thr 


Arg 


1 








5 










10 










15 




Thr 


Asn 


Thr 


Ser 
20 


Met 


Ala 


Pro 


Ser 


Ala 

25 


Gin 


Pro 


Leu 


Pro 


Val 

30 


Ser 


Val 


Ser 


Asp 


Glu 
35 


Lys 


Tyr 


Ala 


Asn 


4 0 


4_i y s 


Trp 


Glu 


Glu 


Leu 
45 


Ala 


Phe 


Lys 


Phe 


Val 
50 


Arg 


Thr 


Asp 


Tyr 


Met 
55 


Tyr 


Val 


Ala 


Lys 


Cys 
60 


Asn 


Kis 


Gly 


Glu 


Ser 


Phe 


Gin 


Glu 


Gly 


Lys 


He 


Leu 


Pre 


Phe 


Ala 


Asp 


Leu 


Gin 


Leu 


Asn 


65 










70 










75 










80 


Pro 


Cys 


Ala 


Ala 


Val 
85 


Leu 


Gin 


Tyr 


Gly 


Gin 
90 


Gly 


Leu 


Tyr 


Glu 


Gly 
95 


Leu 


Lys 


Ala 


Tyr 


Arg 
100 


Thr 


Glu 


Asp 


Gly 


Arg 

105 


He 


Leu 


Leu 


Phe 


Arg 
110 


Pro 


Asp 


Gin 


Asn 


Gly 
115 


Leu 


Arg 


Leu 


Gin 


Ala 
120 


Gly 


Ala 


Asp 


Arg 


Leu 
125 


Tyr 


Met 


Pro 


Tyr 


Pro 
130 


Ser 


Val 


Asp 


Gin 


Phe 
135 


Val 


Ser 


Ala 


He 


Lys 
140 


Gin 


Val 


Ala 


Leu 


Ala 


Asn 


Lys 


Lys 


Trp 


He 


Pro 


Pro 


Pro 


Gly 


Lys 


Gly 


Thr 


Leu 


Tyr 


He 


145 










150 










155 










160 


Arg 


Pro 


He 


Leu 


Phe 
165 


Gly 


Ser 


Gly 


Pro 


He 
170 


Leu 


Gly 


Ser 


Phe 


Pro 
175 


He 


Pro 


Glu 


Thr 


Thr 
180 


Phe 


Thr 


Ala 


Phe 


Ala 
185 


Cys 


Pro 


Val 


Gly 


Arg 
190 


Tyr 


His 


Lys 


Asp 


Asn 
195 


Ser 


Gly 


Leu 


Asn 


Leu 
200 


Lys 


He 


Glu 


Asp 


Gin 
205 


Phe 


Arg 


Arg 


Ala 


Phe 
210 


Pro 


Ser 


Gly 


Thr 


Gly 
215 


Gly 


Val 


Lys 


Ser 


He 
220 


Thr 


Asn 


Tyr 


Cys 
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Pro Val Trp lie Pro Leu Ala Glu Ala Lys Lys Gin Gly Phe Ser Asp 
225 230 235 240 

lie Leu Phe Leu Asp Ala Ala Thr Gly Lys Asn lie Glu Glu Leu Phe 
245 250 255 

5 Ala Ala Asn Val Phe Met Leu Lys Gly Asn Val Val Ser Thr Pro Thr 
260 265 270 

lie Ala Gly Thr lie Leu Pro Gly Val Thr Arg Asn Cys Val Met Glu 

275 280 285 

Leu Cys Arg Asp Phe Gly Tyr Gin Val Glu Glu Arg Thr lie Pro Leu 
10 290 " 295 300 

Val Asd Phe Leu Asp Ala Asp Glu Ala Phe Cys Thr Gly Thr Ala Ser 
305 310 315 320 

lie Val Thr Ser lie Ala Ser Val Thr Phe Lys Asp Lys Lys Thr Gly 
325 330 ^ ^ 335 

15 Phe Lys Thr Gly Glu Glu Thr Leu Ala Ala Lys Leu Tyr Glu Thr Leu 

340 345 350 

Ser As p lie Gin Thr Gly Arg Val Glu Asp Thr Lys Gly Trp Thr Val 

355 360 365 

Glu lie Asp Arg Gin Gly 
20 370 

(2) INFORMATION FOR SEQ ID NO: 417: 
i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 amino acids 

(B) TYPE: amino acid 
2 5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
30 (B) LOCATION: 1..354 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029808 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 
Met Ala Pro Ser Ala Gin Pro Leu Pro Val Ser Val Ser Asp Glu Lys 
15 10 15 

35 Tyr Ala Asn Val Lys Trp Glu Glu Leu Ala Phe Lys Phe Val Arg Thr 

20 25 30 

Asp Tyr Met Tyr Val Ala Lys Cys Asn His Gly Glu Ser Phe Gin Glu 

* 35 4 0 4 5 

Gly Lys He Leu Pro Phe Ala Asp Leu Gin Leu Asn Pro Cvs Ala Ala 
40 50 55 60 

Val Leu Gin Tyr Gly Gin Gly Leu Tyr Glu Gly Leu Lys Ala Tyr Arg 
65 7 0 ~ 7 5 80 

Thr Glu Asp Gly Arg He Leu Leu Phe Arg Pro Asp Gin Asn Gly Leu 
85 90 95 

4 5 Arg Leu Gin Ala Gly Ala Asp Arg Leu Tyr Met Pro Tyr Pro Ser Val 

100 105 110 

Asp Gin Phe Val Ser Ala He Lys Gin Val Ala Leu Ala Asn Lys Lys 

115 120 125 

Trp He Pro Pro Pro Gly Lys Gly Thr Leu Tyr He Arg Pro He Leu 
50 ' 130 135 140 

Phe Gly Ser Gly Pro He Leu Gly Ser Phe Pro lie Pro Glu Thr Thr 
145 150 155 160 

Phe Thr Ala Phe Ala Cys Pro Val Gly Arg Tyr His Lys Asp Asn Ser 
165 170 175 

5 5 Gly Leu Asn Leu Lys lie Glu Asp Gin Phe Arg Arg Ala Phe Pro Ser 

180 185 190 

Gly Thr Gly Gly Val Lys Ser He Thr Asn Tyr Cys Pro Val Trp lie 

195 200 205 

Pro Leu Ala Glu Ala Lys Lys Gin Gly Phe Ser Asp lie Leu Phe Leu 
60 210 215 220 

Asp Ala Ala Thr Gly Lys Asn He Glu Glu Leu Phe Ala Ala Asn Val 
225 "* 230 235 240 



WO 00/40695 



PCT/US00/00466 



542 



Phe 


Met 


Leu 


Lys 


Gly 


Asn 


Val 


Val 


Ser 


Thr 


Pro 


Thr 


He 


Ala 


Gly 


Thr 










245 










250 










255 




He 


Leu 


Pro 


Gly 


Val 


Thr 


Arg 


Asn 


Cys 


Val 


Met 


Glu 


Leu 


Cys 


Arg 


Asp 








260 










265 










270 






Phe 


Gly 


Tyr 


Gin 


Val 


Glu 


Glu 


Arg 


Thr 


He 


Pro 


Leu 


Val 


Asp 


Phe 


Leu 






275 










280 










285 








Asp 


Ala 


Asp 


Glu 


Ala 


Phe 


Cys 


Thr 


Gly 


Thr 


Ala 


Ser 


He 


Val 


Thr 


Ser 




290 










295 










300 










He 


Ala 


Ser 


Val 


Thr 


Phe 


Lys 


Asp 


Lys 


Lys 


Thr 


Gly 


Phe 


Lys 


Thr 


Gly 


305 










310 










315 










320 


Glu 


Glu 


Thr 


Leu 


Ala 


Ala 


Lys 


Leu 


Tyr 


Glu 


Thr 


Leu 


Ser 


Asp 


He 


Gin 










325 










330 










335 




Thr 


Gly 


Arg 


Val 


Glu 


Asp 


Thr 


Lys 


Gly 


Trp 


Thr 


Val 


Glu 


He 


Asp 


Arg 



340 345 350 



15 Gin Gly 



(2) INFORMATION FOR SEQ ID NO: 418: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 320 amino acids 
20 (B) TYPE: amino acid 

tC) STRANDEDNESS : 

• D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
2 5 -;A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 320 

(D) OTHER INFORMATION: / Ceres Seq. ID 2029809 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 
Met Tyr Val Ala Lys Cys Asn His Gly Glu Ser Phe Gin Glu Gly Lvs 
30 1 5 10 15 

He Leu Pro Phe Ala Asp Leu Gin Leu Asn Pro Cys Ala Ala Val Leu 

20 25 30 

Gin Tyr Gly Gin Gly Leu Tyr Glu Gly Leu Lys Ala Tyr Arg Thr Glu 
35^ 40 45 

35 Asp Gly Arg He Leu Leu Phe Arg Pro Asp Gin Asn Gly Leu Arg Leu 
50^ 55 60 

Gin Ala Glv Ala Asp Arg Leu Tyr Met Pro Tyr Pro Ser Val Asp Gin 
65 70 75 80 

Phe Val Ser Ala He Lys Gin Val Ala Leu Ala Asn Lys Lys Trp He 
4 0 85 " 90 95" 

Pro Pro Pro Gly Lys Gly Thr Leu Tyr He Arg Pro He Leu Phe Gly 

100 105 110 

Ser Gly Pro He Leu Gly Ser Phe Pro He Pro Glu Thr Thr Phe Thr 
115 120 125 

4 5 Ala Phe Ala Cys Pro Val Gly Arg Tyr His Lys Asp Asn Ser Gly Leu 
130 135 140 

Asn Leu Lys He Glu Asp Gin Phe Arg Arg Ala Phe Pro Ser Gly Thr 
145 " 150 155 160 

Gly Gly Val Lys Ser lie Thr Asn Tyr Cys Pro Val Trp He Pro Leu 
50 ^ 165 170 175 

Ala Glu Ala Lys Lys Gin Gly Phe Ser Asp He Leu Phe Leu A.sp Ala 

180 185 190 

Ala Thr Gly Lys Asn He Glu Glu Leu Phe Ala Ala Asn Val Phe Met 
195 200 205 

55 Leu Lys Gly Asn Val Val Ser Thr Pro Thr He Ala Gly Thr lie Leu 
210 215 220 

Pro Gly Val Thr Arg Asn Cys Val Met Glu Leu Cys Arg Asp Phe Gly 
225 ^ 230 235 240 

Tyr Gin Val Glu Glu Arg Thr He Pro Leu Val Asp Phe Leu Asp Ala 
60 245 250 255 

Asp Glu Ala Phe Cys Thr Gly Thr Ala Ser He Val Thr Ser He Ala 
260 265 270 
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Ser Val Thr Phe Lys Asp Lys Lys Thr Gly Phe Lys Thr Gly Glu Glu 

275 280 285 

Thr Leu Ala Ala Lys Leu Tyr Glu Thr Leu Ser Asp lie Gin Thr Gly 

290 295 300 

Arg Val Glu Asp Thr Lys Gly Trp Thr Val Glu lie Asp Arg Gin Gly 
305 
i ? \ 



310 

INFORMATION FOR SEQ ID' NO : 4 1 9 : 



315 



320 



:ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



linear 

DNA >. genomic) 



(D) TOPOLOGY: 
MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .98 7 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 9 : 



:D 2030038 



::cct tctctaccaa 



(xi) 
at atccgaaa 
ggccgccatt 
ccctagattc 
ozcctizzccz 
gccga~ aaca 
cgggt ccact 
agct cagaaa 
attctctctc 
atacttgaac 
cagtggggta 

acagcctttg tctgccatta 
cattcctcct atcttcgata 



g-aatcgccg acgggttacg 
ctccgggatc gtcctccggg 



tacacttgga 
gttttgttga 
atcagaattt 
gttctctttt 
tacatgttca 
(2) 



gactcaaaaa cctatcgtca tttatcttca acaggcggaa taacggagat 
acagctctca ctctccgctc tcctgtttat ctcccatcat ctgccacaag 
catggcttca ccaatcaacc accaccagct cgrctcttct trccrcttaa 
cccaaaaccc raaacccatc cgaatttccg catccgcttc 
acaccaatcc tccagacgga gaagtcaaca gctcggtcai: cgacactcac 
cggtctctcg ctactctcgc ggcnttagca arcgctgtaa ccagagtcct 
ctctctctcg caatccagac ttcaagtccc 
agtaccgccg gacctgtctt cttcgcgtct 

acgccgctga cggttgttgc ggtggggata aagaagtggc tagacattta 
ttgatggtta gggttttgct gagttggttc cctaatatcc cttgggaaag 
gagatctctg tgatccttac ttgaatctct tcagaaacat 
cgcttgatgt tagtccattg cctgctttcg cggttcttgg 
tcgattgttc atggcagcac tgggtagaaa ttggaagact tgagctaatg 
tgggaaattt ctgcaacttg tttccnaga ttagggaaaa 
tggatgctcg atgagagttg tgagcti cgtttttgtt 
tctgaataat gagattttgg gttgttgtit atctcctaac 
tattgat 



aacgttcaat 
gatgctgtgt 
tgactttgat 
gactaccgt t 
INFORMATION FOR SEQ ID NO: 420: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .251 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO:420: 
Tyr Leu Lys Asp Ser Lys Thr Tyr Arg 
1 5 
lie Thr Glu Met Ala Ala 
20 

Tyr Leu Pro Ser Ser Ala 
35 

Gin Pro Pro Pro Ala Ara 
50 

Leu Ser lie Gin Asn Pro 
65 70 
Pro lie Thr Thr Pro lie 
85 



(ii) 
(ix) 



(xi ) 



ID 2030039 



His 
10 

lie Thr Ala Leu Thr 
25 

Thr Ser Pro Arg Phe 

4 0 

Leu Phe Phe Pro Leu 
55 

Lvs Ser lie Arg lie 
7 5 

Leu Gin Thr Glu Lys 
90 



Gly 



Leu Ser Ser Thr Gly 
15 

Leu Arg Ser Pro Val 
30 

Gly Phe Thr 
45 

Pro Phe Pro 



His 



p.sn 



Asn 
60 

Ser Ala Ser Ala 



Ser Thr Ala Arg 
95 

Ser Thr Leu Thr Gly Ser Thr Arg Ser Leu Ala Thr Leu Ala Ala 

100 105 110 

Ala lie Ala Val Thr Arg Val Leu Ala Gin Lys Leu Ser Leu Ala 



Ser 

Ser 

80 

Ser 



He 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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115 120 125 

Gin Thr Ser Ser Pro Val lie Ala Asp Gly Leu Arg Phe Ser Leu Ser 

130 135 140 

Thr Ala Gly Pro Val Phe Phe Ala Ser Leu Arg Asp Arg Pro Pro Gly 
5 145 150 155 160 

Tyr Leu Asn Thr Pro Leu Thr Val Val Ala Val Gly lie Lys Lys Trp 

165 170 175 

Leu Asp lie Tyr Ser Gly Val Leu Met Val Arg Val Leu Leu Ser Trp 
180 185 190 

10 Phe Pro Asn lie Pro Trp Glu Arg Gin Pro Leu Ser Ala lie Arg Asp 
195 ^ 200 205 

Leu Cys Asp Pro Tyr Leu Asn Leu Phe Arg Asn lie lie Pro Pro lie 

210 215 220 

Phe Asp Thr Leu Asp Val Ser Pro Leu Leu Ala Phe Ala Val Leu Gly 
15 225 ^ 230 235 240 

Thr Leu Gly Ser lie Val His Gly Ser Thr Gly 

245 250 
(2) INFORMATION FOR SEQ ID NO: 421: 
(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

2 5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..232 

(D) OTHER INFORMATION: / Ceres Seq. ID 2030040 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 
30 Met Ala Ala lie Thr Ala Leu Thr Leu Arg Ser Pro Val Tyr Leu Pro 
15 10 15 

Ser Ser Ala Thr Ser Pro Arg Phe His Gly Phe T hr Asn Gin Pro Pro 

20 25 30 

Pro Ala Arg Leu Phe Phe Pro Leu Asn Pro Phe Pro Ser Leu Ser lie 
35 35 40 45 

Gin Asn Pro Lys Ser lie Arg lie Ser Ala Ser Ala Ser Pro lie Thr 

50 55 60 

Thr Pro lie Leu Gin Thr Glu Lys Ser Thr Ala Arg Ser Ser Thr Leu 
65 70 75 80 

4 0 Thr Gly Ser Thr Arg Ser Leu Ala Thr Leu Ala Ala Leu Ala lie Ala 

85 90 95 

Val Thr Arg Val Leu Ala Gin Lys Leu Ser Leu Ala lie Gin Thr Ser 

100 105 110 

Ser Pro Val lie Ala Asp Gly Leu Arg Phe Ser Leu Ser Thr Ala Gly 
45 115 120 125 

Pro Val Phe Phe Ala Ser Leu Arg Asp Arg Pro Pro Gly Tyr Leu Asn 

130 135 " ^ 140 

Thr Pro Leu Thr Val Val Ala Val Gly lie Lys Lys Trp Leu Asp lie 
145 150 155 160 

50 Tyr Ser Gly Val Leu Met Val Arg Val Leu Leu Ser Trp Phe Pro Asn 

165 170 175 

lie Pro Trp Glu Arg Gin Pro Leu Ser Ala lie Arg Asp Leu Cys Asp 

180 185 190 

Pro Tyr Leu Asn Leu Phe Arg Asn lie lie Pro Pro lie Phe Asp Thr 
55 195 200 205 

Leu Asp Val Ser Pro Leu Leu Ala Phe Ala Val Leu Gly Thr Leu Gly 

210 215 220 

Ser lie Val His Gly Ser Thr Gly 
225 230 
60 (2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(ii) 
(ix) 



(xi) 



ID 2031778 



545 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .755 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 422: 
ccgcagaaaa ccctattgct cccctttgct caagctcagt gcctctttgc agcgaaagct 
caaccaacaa gacgatgatt atctcagaga acaatcgcag agagatctgc aaataccttt 
tcaaagaggg agtatgcttt gctaagaagg atttcaatct cccaaagcat ccgttgattg 
atgtaccaaa cctgcaagtg attaagctca tgcagagttt caagtccaag gagtacgtta 
gggagacatt tgcgtggatg cattattatt ggtttctgac taatgaagga attgagttct 
tgagaactta tcttaacctt ccttccgatg ttgtccctgc tactttgaag aagtctgcta 
agcccggtgg tcgtcccttt ggtggcccac ctggtgatcg ccaaagagga ccacctcgct 
ctgatggaga ccgtcccaga tttggtgacc gtgatggata ccgtggaggc ccacgtggtg 
gtgatgagaa gggtggagct ccagctgatt tccagccgtc tttccaagga ggtggtggta 
ggcctggttt tggccgtggt gcaggcggtt acagtgcagc agcaccatct ggttcagggt 
tcccttgaaa aatttgttgt catattgcga cgatggaagg acagttttgt tttttgttct 
agttctgttt gtgtaatgca aatctggaat ctataatcta tctattacct tcaccttggt 
ttaaacgaac aaatcctgta gtttcaggat tagtg 
(2) INFORMATION FOR SEQ ID NO: 423: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1. .201 

(D) OTHER INFORMATION: / Ceres Seq. ID 2031779 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 
Ala Glu Asn Pro lie Ala Pro Leu Cys Ser Ser Ser Val Pro Leu Cys 
15 10 15 

Ser Glu Ser Ser Thr Asn Lys Thr Met lie lie Ser Glu Asn Asn Arg 

20 25 30 

Arg Glu lie Cys Lys Tyr Leu Phe Lys Glu Gly Val Cys Phe Ala Lys 

35 40 45 

Lys Asp Phe Asn Leu Pro Lys . His Pre Leu lie Asp Val Pro Asn Leu 

50 55 60 

Gin Val lie Lys Leu Met Gin Ser Phe Lys Ser Lys Glu Tyr Val Arg 
65 70 75 80 

Glu Thr Phe Ala Trp Met His Tyr Tyr Trp Phe Leu Thr Asn Glu Gly 

85 90 95 

lie Glu Phe Leu Arg Thr Tyr Leu Asn Leu Pro Ser Asp Val Val Pro 

100 105 110 

Ala Thr Leu Lys Lys Ser Ala Lys Pro Gly Gly Arg Pro Phe Gly Gly 

115 120 125 

Pro Pro Gly Asp Arg Gin Arg Gly Pro Pro Arg Ser Asp Gly Asp Arg 

130 135 140 

Pro Arg Phe Gly Asp Arg Asp Gly Tyr Arg Gly Gly Pro Arg Gly Gly 
145 150 155 160 

Asp Glu Lys Gly Gly Ala Pro Ala Asp Phe Gin Pro Ser Phe Gin Gly 

165 170 175 

Gly Gly Gly Arg Pro Gly Phe Gly Arg Gly Ala Gly Gly Tyr Ser Ala 

180 185 190 

Ala Ala Pro Ser Gly Ser Gly Phe Pro 

195 ' 200 

(2) INFORMATION FOR SEQ ID NO: 42 4: 
(i) SEQUENCE CHARACTERISTICS: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: 1..177 

(D^ OTHER INFORMATION: / Ceres Seq. ID 2031780 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

Met lie lie Ser Glu Asn Asn Arg Arg Glu He Cys Lys Tyr Leu Phe 
1 5 10 " 15 

.Lys Glu Gly Val Cys Phe Ala Lys Lys Asp Phe Asn Leu Pro Lys His 
20 25 30 

15 Pro Leu lie Asp Val Pro Asn Leu Gin Val lie Lys Leu Met Gin Ser 
35 40 45 

Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp Met His Tyr 

50 55 60 

Tyr Trp Phe Leu Thr Asn Glu Gly lie Glu Phe Leu Arg Thr Tyr Leu 
20 65 70 75 80 

Asn Leu Pro Ser Asp Val Val Pro Ala Thr Leu Lys Lys Ser Ala Lys 

85 90 95 

Pro Gly Gly Arg Pro Phe Gly Gly Pro Pro Gly Asp Arg Gin Arg Gly 
100 105 110 

2 5 Pro Pro Arg Ser Asp Gly Asp Arg Pro Arg Phe Gly Asp Arg Asp Gly 
115 120 125 

Tyr Arg Gly Gly Pro Arg Gly Gly Asp Glu Lys Gly Gly Ala Pro Ala 

130 135 140 

Asp Phe Gin Pro Ser Phe Gin Gly Gly Gly Gly Arg Pro Gly Phe Gly 
30 145 150 155 160 

Arg Gly Ala Gly Gly Tyr Ser Ala Ala Ala Pro Ser Gly Ser Gly Phe 
165 170 175 

Pro 

35 (2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 132 

45 (D) OTHER INFORMATION: / Ceres Seq. ID 2031781 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 
Met Gin Ser Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp 
15 10 15 

Met His Tyr Tyr Trp Phe Leu Thr Asn Glu Gly lie Glu Phe Leu Arg 
5 0 20 2 5 30 

Thr Tyr Leu Asn Leu Pro Ser Asp Val Val Pro Ala Thr Leu Lys Lys 

35 40 45 

Ser Ala Lys Pro Gly Gly Arg Pro Phe Gly Gly Pro Pro Gly Asp Arg 
50 55 60 

55 Gin Arg Gly Pro Pro Arg Ser Asp Gly Asp Arg Pro Arg Phe Gly Asp 
65 70 75 80 

Arg Asp Gly Tyr A.rg Gly Gly Pro Arg Gly Gly Asp Glu Lys Gly Gly 

85 90 95^ 

Ala Pro Ala Asp Phe Gin Pro Ser Phe Gin Gly Gly Gly Gly Arg Pro 
60 100 105 110 

Gly Phe Gly Arg Gly Ala Gly Gly Tyr Ser Ala Ala Ala Pro Ser Gly 
115 120 125 
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Ser Gly Phe Pro 
130 

(2) INFORMATION FOR SEQ ID NO: 426: 
(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 538 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic; 
10 (ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: I . . 538 

( D) OTHER INFORMATION: / Ceres Seq. ID 2032723 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 6: 

15 aaagcaataa aaaacaaaaa tggcgtttac tccgaagatc atcacatgcc tcattgtcct 60 
tacgatctac atggcatccc caacagagtc aaccatccag tgcgggacag tgacgagcac 120 
actggcacag tgcctgacct acttgaccaa cagrggtcca ttgccatcac aatgctgcgt 180 
gggagtcaag tcattgtacc aattggctca gaccacaccg gaccgtaaac aagtatgtga 240 
gtgccttaaa ctagcgggta aagaaatcaa gggcctcaac accgaccttg tggccgcact 300 

20 tcctaccact tgtggtgttt caattcccta ccccatcagt tttagcacca attgcgacag 360 
tatatcgact gccgtgtgaa agaggctagt gatcagatgt acgacraatc aaacttgcca 420 
gcttttaacc taattaaata aaagtattct gcttatattt cccattttan gattttatct 480 
tcttatctat gtaaccacac gatttcatat gctaataatg acaacggatc tttctctc 
(2) INFORMATION FOR SEQ ID NO: 4 27: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
30 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 119 

(D) OTHER INFORMATION: / Ceres Seq. ID 2032724 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427~: 

Met Ala Phe Thr Pro Lys lie lie Thr Cys Leu He Val Leu Thr He 
1 5 10 15 

Tyr Met Ala Ser Pro Thr Glu Ser Thr He Gin Cys Gly Thr Val Thr 
20 25 30 

4 0 Ser Thr Leu Ala Gin Cys Leu Thr Tyr Leu Thr Asn Ser Gly Pro Leu 
35 40 45 

Pro Ser Gin Cys Cys Val Gly Val Lys Ser Leu Tyr Gin Leu Ala Gin 

5 0 55 60 

Thr Thr Pro Asp Arg Lys Gin Val Cys Glu Cys Leu Lys Leu Ala Gly 
45 65 70 75 80 

Lys Glu He Lys Gly Leu Asn Thr Asp Leu Val Ala Ala Leu Pro Thr 

85 90 95 

Thr Cys Gly Val Ser He Pro Tyr Pro He Ser Phe Ser Thr Asn Cys 
100 105 110 

50 Asp Ser He Ser Thr Ala Val 
115 

(2) INFORMATION FOR SEQ ID NO: 4 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
55 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

60 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 102 

(D) OTHER INFORMATION: / Ceres Seq. ID 2032725 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
Met Ala Ser Pro Thr Glu Ser Thr lie Gin Cys Gly Thr Val Thr Ser 
15 10 15 

Thr Leu Ala Gin Cys Leu Thr Tyr Leu Thr Asn Ser Gly Pro Leu Pro 
5 20 25 30 

Ser Gin Cys Cys Val Gly Val Lys Ser Leu Tyr Gin Leu Ala Gin Thr 

35 40 45 

Thr Pro Asp Arg Lys Gin Val Cys Glu Cys Leu Lys Leu Ala Gly Lys 
50 55 60 

10 Glu lie Lys Gly Leu Asn Thr Asp Leu Val Ala Ala Leu Pro Thr Thr 
65 70 75 80 

Cys Glv Val Ser lie Pro Tyr Pro lie Ser Phe Ser Thr Asn Cys Asp 

85 -90 95 

Ser lie Ser Thr Ala Val 
15 100 

(2) INFORMATION FOR SEQ ID NO : 4 2 9 : 
(i) SEQUENCE CHARACTERISTICS: 

\A) LENGTH: 7 5 amino acids 
■3) TYPE: amino acid 
2 0 STRANDEDNESS : 

;D) TOPOLOGY: linear 
•:ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

A> NAME /KEY : peptide 
2 5 .3) LOCATION: 1 . . 75 

iD) OTHER INFORMATION: / Ceres Seq. ID 203272 6 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 2 9 : 
Lys Ala lie Lys Asn Lys Asn Gly Val Tyr Ser Glu Asp His His Met 
1 5 10 15 

30 Pro His Cys Pro Tyr Asp Leu His Gly lie Pro Asn Arg Val Asn His 

20 25 30 

Pro Val Trp Asp Ser Asp Glu His Thr Gly Thr Val Pro Asp Leu Leu 

35~ 4 0 4 5 

Asp Gin Gin Trp Ser lie Ala lie Thr Met Leu Arg Gly Ser Gin Val 
35 50 55 60 

lie Val Pre lie Gly Ser Asp His Thr Gly Pro 
65 70 75 

{2} INFORMATION FOR SEQ ID MO: 430: 
(i) SEQUENCE CHARACTERISTICS: 
4 0 .'A) LENGTH: 198 base pairs 

(3) TYPE: nucleic acid 
iZ) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
4 5 (ix) FEATURE: 

(A) NAME /KEY : - 

(5) LOCATION: 1 . . 198 

{ D) OTHER INFORMATION: / Ceres Seq. ID 2035536 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 
50 atgtcgttgg natggttgga agcgatgttg cctctcggaa tcatcggtgg gatgetctgt 60 
atcatgggca attctcagta ctacatccac aaagcttatc atggccgtcc taagcacatc 120 
ggccacgatg aatgggatgt tgctatggaa agacgegaca agaaagtcgt cgagaaagct 180 
gcagctcctt cctcatga 

(2) INFORMATION FOR SEQ ID NO: 4 31: 
55 (i) SEQUENCE CHARACTERISTICS: 

V 'A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D) TOPOLOGY: linear 
60 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 
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(B) LOCATION: 1 . . 65 

(D) OTHER INFORMATION: / Ceres Seq. ID 2035537 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 





Met 


Ser 


Leu 


Val 


Trp 


Leu 


Glu Ala Met 


Leu 


Pro 


Leu 


Gly 


He 


He 


Gly 


5 


1 








5 






10 










15 






Gly 


Met 


Leu 


Cys 
20 


He 


Met 


Gly Asn Ser 

25 


Gin 


Tyr 


Tyr 


He 


His 
30 


Lys 


Ala 




Tyr 


His 


Gly 

35 


Arg 


Pro 


Lys 


His He Gly 
40 


His 


Asp 


Glu 


Trp 
45 


Asp 


Val 


Ala 


10 


Met 

Ser 

65 


Glu 

50 


Arg 


Arg 


Asp 


Lys 


Lys Val Val 
55 


Glu 


Lys 


Ala 
60 


Ala 


Ala 


Pro 


Ser 




(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 432 : 

















15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
2 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: I . . 65 

(D) OTHER INFORMATION: / Ceres Seq. ID 2035538 





(xi) 


SEQUENCE DESCRIPTION: ; 


SEQ 


ID NO: 4 32 












Cys 


Arg 


Trp 


Tyr Gly 


Trp Lys Arg 


Cys 


Cys 


Leu 


Ser 


Glu 


Ser 


Ser 


Val 


1 






5 






10 










15 




Gly 


Cys 


Ser 


Val Ser 


Trp Ala He 


Leu 


Ser 


Thr 


Thr 


Ser 


Thr 


Lys 


Leu 








20 




25 










30 






He 


Met 


Ala 


Val Leu 


Ser Thr Ser 


Ala 


Thr 


Met 


Asn 


Gly 


Met 


Leu 


Leu 






35 




40 










45 








Trp 


Lys 


Asp 


Ala Thr 


Arg Lys Ser 


Ser 


Arg 


Lys 


Leu 


Gin 


Leu 


Leu 


Pro 



50 55 60 



His 
35 65 

(2) INFORMATION FOR SEQ ID NO: 433: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
4 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
4 5 (B) LOCATION: 1..57 

(D) OTHER INFORMATION: / Ceres Seq. ID 2035539 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 
Met Leu Pro Leu Gly He He Gly Gly Met Leu Cys He Met Gly Asn 
15 10 15 

50 Ser Gin Tyr Tyr He His Lys Ala Tyr His Gly Arg Pro Lys His lie 

20 25 30 

Gly His Asp Glu Trp Asp Val Ala Met Glu Arg Arg Asp Lys Lys Val 

35 40 45 

Val Glu Lys Ala Ala Ala Pro Ser Ser 
55 50 ' 55 

(2) INFORMATION FOR SEQ ID NO: 4 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
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1 








5 










10 










15 




Lys 


Val 


Pro 


Val 


Asn 


Ser 


Lys 


Thr 


His 


Leu 


Gin 


Leu 


His 


Gly 


Glu 


Leu 








20 










25 










30 






Asp 


Thr 


Gly 


Thr 


Gly 


Ala 


Pro 


Ser 


Tyr 


Phe 


Cys 


Ala 


Met 


He 


Arg 


His 






35 










40 










45 








Phe 


Phe 


Pro 


Glu 


Ala 


Ser 


Thr 


Gly 


Leu 


Gly 


Val 


Gly 


Leu 


His 


Tyr 


Asp 




50 










55 










60 










Lys 


Arg 


Gin 


Lys 


Leu 


Arg 


Cys 


Leu 


Val 


Arg 


Gly 


Lys 


Lys 


Glu 


Phe 


Pro 


65 










70 










75 










80 


Val 


Arg 


Ala 


Asp 


Lys 


Arg 


Val 


Thr 


Phe 


Asn 


lie 


Lys 


Gly 


Arg 


Cys 


Asp 










85 










90 










95 




He 


Asp 


Gin 


Asp 


Leu 


Asn 


Gin 


Lys 


Asn 


Pro 


Lys 


Gly 


Ala 


Ala 


Glu 


Phe 








100 










105 










110 






Ala 


Trp 


Asn 


He 


Met 


Asp 


Phe 


Lys 


Glu 


Asp 


Gin 


Asp 


Val 


Arg 


He 


Lys 






115 










120 










125 








Val 


Gly 


Tyr 


Glu 


Met 


Phe 


Asp 


Lys 


Val 


Pro 


Tyr 


Met 


Gin 


He 


Arg 


Glu 




130 










135 










140 










Asn 


Asn 


Trp 


Thr 


Leu 


Asn 


Ala 


Asn 


Met 


Lys 


Gly 


Lys 


Trp 


Asn 


Leu 


Arg 


145 










150 










155 










160 


Tyr 


Asp 


Leu 





























(2) INFORMATION FOR SEQ ID MO: 4 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

30 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . .15 4 

( D) OTHER INFORMATION: / Ceres Seq. ID 2035578 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 
Met Lys lie His Ala Lys Glu Lys Val Pro Val Asn Ser Lys Thr His 
35 1 5 10 15 

Leu Gin Leu His Gly Glu Leu Asp Thr Gly Thr Gly Ala Pro Ser Tyr 

20 25 30 

Phe Cys Ala Met He Arg His Phe Phe Pro Glu Ala Ser Thr Gly Leu 
35 40 45 

4 0 Gly Val Gly Leu His Tyr Asp Lys Arg Gin Lys Leu Arg Cys Leu Val 
50 55 60 

Arg Gly Lys Lys Glu Phe Pro Val Arg Ala Asp Lys Arg Val Thr Phe 
65 7 0 7 5 8 0 

Asn He Lys Gly Arg Cys Asp He Asp Gin Asp Leu Asn Gin Lys Asn 
4 5 " " 85^ ' 90 95 

Pro Lys Gly Ala Ala Glu Phe Ala Trp Asn lie Met Asp Phe Lys Glu 

100 105 110 

Asp Gin Asp Val Arg He Lys Val Gly Tyr Glu Met Phe Asp Lys Val 
115 120 125 

50 Pro Tyr Met Gin He Arg Glu Asn Asn Trp Thr Leu Asn Ala Asn Met 
130 135 140 

Lys Gly Lys Trp Asn Leu Arg Tyr Asp Leu 
145 " 150 

(2) INFORMATION FOR SEQ ID NO: 438: 
55 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 



WO 00/40695 



PCT/US00/00466 



552 

( B } LOCATION: 1 . . 750 

(D) OTHER INFORMATION : / Ceres Seq. ID 2036457 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 
atgacgactc agatcagcaa gaagagaaag tttgtagcgg acggtgtgtt ctacgctgaa 60 
5 ttgaatgagg ttcttacaag agagctagca gaggatggtt actctggtgt tgaggttagg 120 
gttactccta tgaggactga gattatcatc agagctacac gtactcagaa tgttctcggt 180 
gagaagggga ggagaattag ggaattgact tcccttgttc agaagagatt caagtttcca 240 
grtgacagrg ttgagcttta tgccgagaag gttaacaaca gaggtctctg tgccatcgct 300 
caggctgagt ctttacgtta caagcttctt ggtggtctcg ctgttcgtag ggcttgctat 360 
10 ggrgngttga ggtttgttat ggagagtgga gctaagggat gcgaggttat cgtgagtgga 420 
aagcttcgtg ctgccagagc caagtctatg aagttcaaag atggctacat ggtgtcatct 480 
ggtcaaccaa ctaaggaata catagactct gcagtgagac atgttttgct tagacaaggt 540 
gtgctgggaa ticaaggtgaa ggttatgctt gattgggacc ctaagggcat atcaggacca 600 
aagacaccat rgcctgatgt tgtgatcatt cattctccta aagaagaaga ggccatcrar 660 
15 gcacctgctc aggttgctgc cccggctgct ctcgtagcag atgcaccact cacagccgna 720 
gactaccctg cgatgatccc agtcgcctaa 
(2) INFORMATION FOR SEQ ID NO: 439: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 amino acids 
2 0 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

■ C ; TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
2 5 (A) NAME /KEY : peptide 

(B) LOCATION: 1. .249 

( D) OTHER INFORMATION: / Ceres Seq. ID 2036458 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 

Met Thr Thr Gin lie Ser Lys Lys Arg Lys Phe Val Ala Asp Gly Val 
30 1 5 10 15 

Phe Tyr Ala Glu Leu Asn Glu Val Leu Thr Arg Glu Leu Ala Glu Asp 

20 25 30 

Gly Tyr Ser Gly Val Glu Val Arg Val Thr Pro Met Arg Thr Glu lie 
35 40 45 

35 lie lie Arg Ala Thr Arg Thr Gin Asn Val Leu Gly Glu Lys Gly Arg 
50 55 60 

Arg lie Arg Glu Leu Thr Ser Leu Val Gin Lys Arg Phe Lys Phe Pro 
65^ 70 75 80 

Val Asp Ser Val Glu Leu Tyr Ala Glu Lys Val Asn Asn Arg Gly Leu 
4 0 85 .90 95~ 

Cys Ala lie Ala Gin Ala Glu Ser Leu Arg Tyr Lys Leu Leu Gly Gly 

100 105 110 

Leu Ala Val Arg Arg Ala Cys Tyr Gly Val Leu Arg Phe Val Met Glu 
115 120 125 

4 5 Ser Gly Ala Lys Gly Cys Glu Val lie Val Ser Gly Lys Leu Arg Ala 
130 135 140 

Ala Arg Ala Lys Ser Met Lys Phe Lys Asp Gly Tyr Met Val Ser Ser 
145 150 155 160 

Gly Gin Pro Thr Lys Glu Tyr lie Asp Ser Ala Val Arg His Val Leu 
50 " 165 170 175 

Leu Arg Gin Gly Val Leu Gly lie Lys Val Lys Val Met Leu Asp Trp 

180 185 190 

Asp Pro Lys Gly lie Ser Gly Pro Lys Thr Pro Leu Pro Asp Val Val 
195 200 205 

55 lie lie His Ser Pro Lys Glu Glu Glu Ala lie Tyr Ala Pro Ala Gin 
210 215 220 

Val Ala Ala Pro Ala Ala Leu Val Ala Asp Ala Pro Leu Thr Ala Val 
225 230 235 240 

Asp Tvr Pro Ala Met lie Pro Val Ala 
60 1 245 

(2) INFORMATION FOR SEQ ID NO : 4 4 0 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 20 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 2036459 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:440: 



Met 


Arg 


Thr 


Glu 


He 


He 


He 


Arg 


Ala 


Thr 


Arg 


Thr 


GI 


r . 


Asn 


Val 


Leu 


1 








5 










10 












15 




Gly 


Glu 


Lys 


Gly 


Arg 


Arg 


He 


Arg 


Glu 


Leu 


Thr 


Ser 


Le 


M 


Val 


Gin 


Lys 








20 










25 












30 






Arg 


Phe 


Lys 


Phe 


Pro 


Val 


Asp 


Ser 


Val 


Glu 


Leu 


Tyr 


Al 


a 


Glu 


Lys 


Val 






35 










40 










45 










Asn 


Asn 


Arg 


Gly 


Leu 


Cys 


Ala 


He 


Ala 


Gin 


Ala 


Glu 


Se 


V 


Leu 


Arg 


Tyr 




50 










55 










60 












Lys 


Leu 


Leu 


Gly 


Gly 


Leu 


Ala 


Val 


Arg 


Arg 


Ala 


Cys 


Ty 




Gly 


Val 


Leu 


65 










70 










75 












8 0 


Arg 


Phe 


Val 


Met 


Glu 


Ser 


Gly 


Ala 


Lys 


Gly 


Cys 


Glu 


V CL 




He 


Val 


Ser 










85 










90 












95 




Gly 


Lys 


Leu 


Arg 


Ala 


Ala 


Arg 


Ala 


Lys 


Ser 


Met 


Lys 


Ph 




Lys 


Asp 


Gly 








100 










105 












110 






Tyr 


Met 


Val 


Ser 


Ser 


Gly 


Gin 


Pro 


Thr 


Lys 


Glu 


Tyr 


II 




Asp 


Ser 


Ala 






115 










120 










12 










Val 


Arg 


His 


Val 


Leu 


Leu 


Arg 


Gin 


Gly 


Val 


Leu 


Gly 


11 




Lys 


Val 


Lys 




130 










135 










140 












Val 


Met 


Leu 


Asp 


Trp 


Asp 


Pro 


Lys 


Gly 


He 


Ser 


Gly 


Pr 




Lys 


Thr 


Pro 


145 










150 










155 












160 


Leu 


Pro 


Asp 


Val 


Val 


He 


He 


His 


Ser 


Pro 


Lys 


Glu 


Gl 




Glu 


Ala 


He 










165 










170 












175 




Tyr 


Ala 


Pro 


Ala 


Gin 


Val 


Ala 


Ala 


Pro 


Ala 


Ala 


Leu 


Va 




Ala 


Asp 


Ala 








180 










185 












190 






Pro 


Leu 


Thr 


Ala 


Val 


Asp 


Tyr 


Pro 


Ala 


Met 


He 


Pro 


Va 




Ala 










195 










200 










2 0 


5 









(2) INFORMATION FOR SEQ ID NO : 4 4 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 549 

(D) OTHER INFORMATION: / Ceres Seq. ID 2:36585 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 1 : 



atggttctca agacggagct ttgtcgtttc agtggacaga agattiaccc aggaagagga 60 
attaggttta tccgatctga ttctcaggtt ttcttgtttc ttaacucaaa atgtaagagg 120 
tacttccata acaagttgaa gccatccaag cttgcatgga ctgccaigta cagaaagcaa 180 
cacaagaagg atgcagcaca agaggctgtg aagagaagga gacgtgccac caagaagcca 24 0 

tactcaaggt ccattgttgg tgctaccttg gaagtaattc agaagaagag agctgagaag 300 
cctgaagttc gtgatgcagc cagggaagct gctctgcgtg agatcaagga aagaatcaaa 360 
aagaccaaag atgaaaagaa ggctaagaag gtggaatttg cttctaagca acagaaggtc 420 
aaggctaatt tccccaaagc tgctgctgca tccaagggtc ctaaggrggt ggtggcaaac 480 
gctgaagagc ttaaagccat cttttctcac tctgcgtctt ttctgcuagt agctactttt 540 
agtagttga 

(2) INFORMATION FOR SEQ ID NO: 442: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

5 (A) NAME/KEY: peptide 

(B) LOCATION: 1. . 182 

(D) OTHER INFORMATION: / Ceres Seq. ID 2036586 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 2 : 





Met 


Val 


Leu 


Lys 


Thr 


Glu 


Leu 


Cys 


Arg 


Phe 


Ser 


Gly 


Gin 


Lys 


He 


Tyr 


10 


1 








5 










10 










15 






Pro 


Gly 


Arg 


Gly 
20 


He 


Arg 


Phe 


He 


Arg 
25 


Ser 


Asp 


Ser 


Gin 


Val 
30 


Phe 


Leu 




Phe 


Leu 


Asn 
35 


Ser 


Lys 


Cys 


Lys 


Arg 
40 


Tyr 


Phe 


His 


Asn 


Lys 
45 


Leu 


Lys 


Pro 


15 


Ser 


Lys 
50 


Leu 


Ala 


Trp 


Thr 


Ala 
55 


Met 


Tyr 


Arg 


Lys 


Gin 

60 


His 


Lys 


Lys 


Asp 




Ala 


Ala 


Gin 


Glu 


Ala 


Val 


Lys 


Arg 


Arg 


Arg 


Arg 


Ala 


Thr 


Lys 


Lys 


Pro 




65 










70 










75 










80 




Tyr 


Ser 


Arg 


Ser 


He 


Val 


Gly 


Ala 


Thr 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Lys 


20 










85 










90 










95 






Arg 


Ala 


Glu 


Lys 
100 


Pro 


Glu 


Val 


Arg 


Asp 
105 


Ala 


Ala 


Arg 


Glu 


Ala 
110 


Ala 


Leu 




Arg 


Glu 


He 
115 


Lys 


Glu 


Arg 


He 


Lys 
120 


Lys 


Thr 


Lys 


Asp 


Glu 
125 


Lys 


Lys 


Ala 


25 


Lys 


Lys 
130 


Val 


Glu 


Phe 


Ala 


Ser 
135 


Lys 


Gin 


Gin 


Lys 


Val 
140 


Lys 


Ala 


Asn 


Phe 




Pro 


Lys 


Ala 


Ala 


Ala 


Ala 


Ser 


Lys 


Gly 


Pro 


Lys 


Val 


Val 


Val 


Ala 


Asn 




145 










150 










155 










160 




Ala 


Glu 


Glu 


Leu 


Lys 


Ala 


He 


Phe 


Ser 


His 


Ser 


Ala 


Ser 


Phe 


Leu 


Leu 


30 


Val 


Ala 


Thr 


Phe 
180 


165 
Ser 


Ser 








170 










175 






(2) 


INFORMATION 


FOR 


SEQ 


ID • 


NO: 4 43: 

















(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 127 

(D) OTHER INFORMATION: / Ceres Seq. ID 2036587 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 3 : 



Met 


Tyr 


Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 


Ala 


Ala 


Gin 


Glu 


Ala 


Val 


Lys 


1 








5 










10 










15 




Arg 


Arg 


Arg 


Arg 


Ala 


Thr 


Lys 


Lys 


Pro 


Tyr 


Ser 


Arg 


Ser 


He 


Val 


Gly 








20" 










25 










30 






Ala 


Thr 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Lys 


Arg 


Ala 


Glu 


Lys 


Pro 


Glu 


Val 






35 










40 










45 








Arg 


Asp 


Ala 


Ala 


Arg 


Glu 


Ala 


Ala 


Leu 


Arg 


Glu 


He 


Lys 


Glu 


Arg 


He 




50 










55 










60 










Lys 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 


Lys 


Lys 


Val 


Glu 


Phe 


Ala 


Ser 


65 










70 










75 










80 


Lys 


Gin 


Gin 


Lys 


Val 


Lys 


Ala 


Asn 


Phe 


Pro 


Lys 


Ala 


Ala 


Ala 


Ala 


Ser 










85 










90 










95 




Lys 


Gly 


Pro 


Lys 


Val 


Val 


Val 


Ala 


Asn 


Ala 


Glu 


Glu 


Leu 


Lys 


Ala 


He 








100 










105 










110 






Phe 


Ser 


His 


Ser 


Ala 


Ser 


Phe 


Leu 


Leu 


Val 


Ala 


Thr 


Phe 


Ser 


Ser 








115 










120 










125 









(2) INFORMATION FOR SEQ ID NO : 4 4 4 : 
(i) SEQUENCE CHARACTERISTICS: 
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(B) LOCATION: 1 . .116 

(D) OTHER INFORMATION: / Ceres Seq. ID 2039556 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 6 : 
Met Ala Pro Gin Lys Lys Val Asn Lys Tyr Asp Ala Lys Trp Lys Lys 
5 1 5 10 , " 15 

Gin Trp Tyr Gly Ala Gly Leu Phe Phe Glu Gly Ser Glu Gin lie Asn 

20 25 30 

Val Asp Val Phe Lys Lys Leu Glu Lys Arg Lys Val Leu Ser Asn Val 
35 40 45 

10 Glu Lys Ser Gly Leu Leu Ser Lys Ala Glu Gly Leu Gly Leu Thr Leu 
50 55 60 

Ser Ser Leu Glu Lys Leu Lys Val Phe Ser Lys Ala Glu Asp Leu Glv 
65 " 70 75 8 0~ 

Leu Leu Ser Leu Leu Glu Asn Leu Ala Gly Thr Ser Pro Ala Val Leu 
15 85 90 95 

Ala Ser Ala Ala Leu Pro Ala Leu Thr Ser Cys Tyr Cys Ser Ara Gly 

100 105 110 

Val Asp Pro Gly 
115 

20 (2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..37 8 

30 (D) OTHER INFORMATION: / Ceres Seq, ID 2044283 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 
atggcgaatt tgatgatgag attaccaatt agcttgagaa gcttctctgt ttcagcttct 60 
tcatccaacg gttcgccgcc ggtgatcgga ggatctagcg gcggtgtagg accgatgatt 120 
gtggaattac cgttggagaa gatacgaaga ccgttgatgc gaaccagatc caacgatcag 180 

35 aacaaagtga aagagcttat ggatagtatc cgtcaaatcg gtcttcaagt tccgattgat 240 
gtgattgaag ttgatggaac ttactatggg ttctcgggat gtcacagata cgaggcgcat 300 
cagaagctag ggcttccaac tatacgttgc aaaatccgta aaggaacaaa ggaaacatta 360 
aggcatcatc ttcgctga 

(2) INFORMATION FOR SEQ ID NO: 448: 
4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 125 

(D) OTHER INFORMATION: / Ceres Seq. ID 2044284 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 8 : 

Met Ala Asn Leu Met Met Arg Leu Pro lie Ser Leu Arg Ser Phe Ser 
15 10 15 

Val Ser Ala Ser Ser Ser Asn Gly Ser Pro Pro Val lie Gly Gly Ser 
20 25 30 

55 Ser Gly Gly Val Gly Pro Met He Val Glu Leu Pro Leu Glu Lys He 
35 40 45 

Arg Arg Pro Leu Met Arg Thr Arg Ser Asn Asp Gin Asn Lys Val Lys 

50 55 60 

Glu Leu Met Asp Ser He Arg Gin He Gly Leu Gin Val Pro lie Asp 
60 65 70 75 80 

Val He Glu Val Asp Gly Thr Tyr Tyr Gly Phe Ser Gly Cys His Arg 
85 90 95 



WO 00/40695 



PCT/US00/00466 



557 

Tyr Glu Ala His Gin Lys Leu Gly Leu Pro Thr lie Arg Cys Lys lie 

100 105 110 

Arg Lys Gly Thr Lys Glu Thr Leu Arg His His Leu Arg 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 44 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..121 

(D) OTHER INFORMATION: / Ceres Seq. ID 2044285 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 



Met 


Met 


Arg 


Leu 


Pro 


He 


Ser 


Leu 


Arg 


Ser 


Phe 


Ser 


Val 


Ser 


Ala 


Ser 


1 








5 










10 










15 




Ser 


Ser 


Asn 


Gly 
20 


Ser 


Pro 


Pro 


Val 


He 

25 


Gly 


Gly 


Ser 


Ser 


Gly 
30~ 


Gly 


Val 


Gly 


Pro 


Met 

35 


He 


Val 


Glu 


Leu 


Pro 
40 


Leu 


Glu 


Lys 


He 


Arg 

45~^ 


Arg 


Pro 


Leu 


Met 


Arg 
50^ 


Thr 


Arg 


Ser 


Asn 


Asp 
55 


Gin 


Asn 


Lys 


Val 


Lys 
60 


Glu 


Leu 


Met 


Asp 


Ser 


lie 


Arg 


Gin 


He 


Gly 


Leu 


Gin 


Val 


Pro 


He 


Asp 


Val 


lie 


Glu 


Val 


65 










70 










75 










80 


Asp 


Gly 


Thr 


Tyr 


Tyr 
85 


Gly 


Phe 


Ser 


Gly 


Cys 
90 


His 


Arg 


Tyr 


Glu 


Ala 

95 


His 


Gin 


Lys 


Leu 


Gly 
100 


Leu 


Pro 


Thr 


He 


Arg 
105 


Cys 


Lys 


He 


Arg 


Lys 
110 


Gly 


Thr 


Lys 


Glu 


Thr 


Leu 


Arg 


His 


His 


Leu 


Arg 

















115 120 



(2) INFORMATION FOR SEQ ID NO: 450: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 120 

(D) OTHER INFORMATION: / Ceres Seq. ID 2044286 





(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO: 450 : 










Met 


Arg 


Leu 


Pro 


He 


Ser 


Leu 


Arg 


Ser 


Phe 


Ser 


Val 


Ser 


Ala 


Ser 


Ser 


1 








5 










10 










15 




Ser 


Asn 


Gly 


Ser 


Pro 


Pro 


Val 


He 


Gly 


Gly 


Ser 


Ser 


Gly 


Gly 


Val 


Gly 








20 










25 










30 






Pro 


Met 


He 


Val 


Glu 


Leu 


Pro 


Leu 


Glu 


Lys 


He 


Arg 


Arg 


Pro 


Leu 


Met 






35 










40 










45 








Arg 


Thr 


Arg 


Ser 


Asn 


Asp 


Gin 


Asn 


Lys 


Val 


Lys 


Glu 


Leu 


Met 


Asp 


Ser 




50 










55 










60 










He 


Arg 


Gin 


He 


Gly 


Leu 


Gin 


Val 


Pro 


He 


Asp 


Val 


He 


Glu 


Val 


Asp 


65 










70 










75 










80 


Gly 


Thr 


Tyr 


Tyr 


Gly 


Phe 


Ser 


Gly 


Cys 


His 


Arg 


Tyr 


Glu 


Ala 


His 


Gin 










85 










90 










95 




Lys 


Leu 


Gly 


Leu 


Pro 


Thr 


He 


Arg 


Cys 


Lys 


He 


Arg 


Lys 


Gly 


Thr 


Lys 








100 










105 










110 






Glu 


Thr 


Leu 


Arg 


His 


His 


Leu 


Arg 






















115 










120 


















(2) 


INFORMATION 


FOR 


SEQ 


ID 1 


NO: 451: 

















(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1236 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 1236 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048114 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 1 : 



atgagaagac ctagtcaaat gatgaggctt ctattaacat cctttttcgg tgttattgtt 60 

ggtttcctta tgggtattac ttttccaacc ttgactttaa ctaagatgaa tcttccatcc 120 

acattgtttc cctcgattga tcttgcatac attgaggata aatactctga catatcaaga 180 

caaagactat ttggttcttg gtcttcgaca aaaggcctca aactcaagaa tgacatccci 240 

gaccctccat ataactataa tgacactaag gttgatgata gaacgttcga gctattgcag 300 

atatgggttt cgactaaccc ccgtggtgct gagaggctac caccagatat agtcacgcct 360 

gaatcagatt tttacctccg tcgactgtgg ggcgacccta atgaggattt aacagtcaag 420 

cagcggtatc tagtaacatt tacggttggc tatgatcaga ggaaaaatat agacactgtg 480 

ttgaagaagt tctcagataa cttctctata atgctgtttc actacgatgg ccgggcaagc 540 

gaatgggaag agtttgaatg gtccaagcga gccattcatg tgagcattcg gaaacaaaca 600 

aaatggtggt acgctaagcg atttcttcat cctgacatag ttgcccccra tgaatatatc 660 

ttcatatggg atgaggatct tggcgtggaa cactttgatt cggaaaaata tctggcggtg 720 

gtgaagaagc atggtttgga aaictcacag cctggattag agccatatga agggctcaca 780 

tgggagatga ccaagaaaag agacgacact gaagtccaca agcatgcrga ggaaaggaat 84 0 

gggtggtgca ctgatcccaa tttaccccct tgtgcagcgt ttgtggagat tatggctcct 900 

gttttctccc gcaaggcatg gcgctgtgtg tggcatatga ttcagaacga tttgattcat 960 

ggatggggtc tggactttgc cgttcggaaa tgtgttcaga acgcacacga gaaaattgga 1020 

gttgtagatg ctcaatggat tatacatcaa ggtgttccat cattagggaa tcaaggacaa 1080 

ccagagcaag ggaaacaacc atgggaaggg gtgagagaac gatgcaggag agagtggaca 1140 

atgtttcaag acagattgga tgatgctgaa aaagcttatt ttgaagcatc tgctcacaag 1200 



aatgcttctt cacggcctca cgggaattgg gtatag 
(2) INFORMATION FOR SEQ ID NO: 4 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1. .411 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048115 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



Met 


Arg 


Arg 


Pro 


Ser 


Gin 


Met 


Met 


Arg 


Leu 


Leu 


Leu 


Thr 


Ser 


Phe 


Phe 


1 








5 










10 










15 




Gly 


Val 


He 


Val 


Gly 


Phe 


Leu 


Met 


Gly 


He 


Thr 


Phe 


Pro 


Thr 


Leu 


Thr 








20 










25 










30 






Leu 


Thr 


Lys 


Met 


Asn 


Leu 


Pro 


Ser 


Thr 


Leu 


Phe 


Pro 


Ser 


He 


Asp 


Leu 






35 










40 










45 








Ala 


Tyr 


He 


Glu 


Asp 


Lys 


Tyr 


Ser 


Asp 


He 


Ser 


Arg 


Gin 


Arg 


Leu 


Phe 




50 










55 










60 










Gly 


Ser 


Trp 


Ser 


Ser 


Thr 


Lys 


Gly 


Leu 


Lys 


Leu 


Lys 


Asn 


Asp 


He 


Pro 


65 










70 










75 










80 


Asp 


Pro 


Pro 


Tyr 


Asn 


Tyr 


Asn 


Asp 


Thr 


Lys 


Val 


Asp 


Asp 


Arg 


Thr 


Phe 










85 










90 










95 




Glu 


Leu 


Leu 


Gin 


He 


Trp 


Val 


Ser 


Thr 


Asn 


Pro 


Arg 


Gly 


Ala 


Glu 


Arg 








100 










105 










110 






Leu 


Pro 


Pro 


Asp 


He 


Val 


Thr 


Pro 


Glu 


Ser 


Asp 


Phe 


Tyr 


Leu 


Arg 


Arg 






115 










120 










125 








Leu 


Trp 


Gly 


Asp 


Pro 


Asn 


Glu 


Asp 


Leu 


Thr 


Val 


Lys 


Gin 


Arg 


Tyr 


Leu 




130 










135 










140 










Val 


Thr 


Phe 


Thr 


Val 


Gly 


Tyr 


Asp 


Gin 


Arg 


Lys 


Asn 


He 


Asp 


Thr 


Val 
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145 150 155 160 

Leu Lys Lys Phe Ser Asp Asn Phe Ser lie Met Leu ?he His Tyr Asp 

165 17 0 17 5 

Gly Arg Ala Ser Glu Trp Glu Glu Phe Glu Trp Ser Lys Arg Ala lie 
5 180 185 " 190 

His Val Ser lie Arg Lys Gin Thr Lys Trp Trp Tyr Ala Lys Arg Phe 

195 ~ 200 " " " * 205 

Leu His Pro Asp lie Val Ala Pro Tyr Glu Tyr lie Phe lie Trp Asp 
210 215 220 

10 Glu Asp Leu Gly Val Glu His Phe Asp Ser Glu Lys Tyr Leu Ala Val 
225 " 230 " 235 " 240 

Val Lys Lys His Gly Leu Glu lie Ser Gin Pro Gly Leu Glu Pro Tyr 

245 250 255 

Glu Gly Leu Thr Trp Glu Met Thr Lys Lys Arg Asp Asp Thr Glu Val 
15 260 265 270 

His Lys His Ala Glu Glu Arg Asn Gly Trp Cys Thr Asp Pro Asn Leu 

275 280 285 

Pro Pro Cys Ala Ala Phe Val Glu lie Met Ala Pre Val Phe Ser Arq 
2 90 295 300 

2 0 Lys Ala Trp Arg Cys Val Trp His Met lie Gin Asn Asp Leu lie His 

305 310 315 320 

Gly Trp Gly Leu Asp Phe Ala Val Arg Lys Cys Val Gin Asn Ala His 

32 5 3 30 33 5 

Glu Lys lie Gly Val Val Asp Ala Gin Trp lie lie His Gin Gly Val 
25 340 345 350 

Pro Ser Leu Gly Asn Gin Gly Gin Pro Glu Gin Glv Lys Gin Pro Trp 

355 360 365 

Glu Gly Val Arg Glu Arg Cys Arg Arg Glu Trp Thr Met Phe Gin Asp 
370 375 380 

3 0 Arg Leu Asp Asp Ala Glu Lys Ala Tyr Phe Glu Ala Ser Ala His Lys 

385 390 395 400 

Asn Ala Ser Ser Arg Pro His Gly Asn Trp Val 

405 410 
(2) INFORMATION FOR SEQ ID NO: 4 53: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

4 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .405 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048116 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:453: 

Met Met Arg Leu Leu Leu Thr Ser Phe Phe Gly Val lie Val Gly Phe 
15 10 15 

Leu Met Gly lie Thr Phe Pro Thr Leu Thr Leu Thr Lys Met Asn Leu 
20 25 30 

50 Pro Ser Thr Leu Phe Pro Ser lie Asp Leu Ala Tyr lie Glu Asp Lys 
35 4 0 4 5 

Tyr Ser Asp lie Ser Arg Gin Arg Leu Phe Gly Ser Trp Ser Ser Thr 

50 55 60 

Lys Gly Leu Lys Leu Lys Asn Asp lie Pro Asp Pro Pro Tyr Asn Tyr 
55 65 ~ 70 75 80 

Asn Asp Thr Lys Val Asp Asp Arg Thr Phe Glu Leu Leu Gin lie Trp 

8 5 90 95 

Val Ser Thr Asn Pro Arg Gly Ala Glu Arg Leu Pro Pro Asp lie Val 
100 105 110 

60 Thr Pro Glu Ser Asp Phe Tyr Leu Arg Arg Leu Trp Gly Asp Pro Asn 
115 120 125 

Glu Asp Leu Thr Val Lys Gin Arg Tyr Leu Val Thr Phe Thr Val Gly 
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130 135 140 

Tyr Asp Gin Arg Lys Asn lie Asp Thr Val Leu Lys Lys Phe Ser Asp 
145 ^ " 150 155 160 

Asn Phe Ser lie Met Leu Phe His Tyr Asp Gly Arg Ala Ser Glu Trp 
5 165 17 0 17 5 

Glu Glu Phe Glu Trp Ser Lys Arg Ala lie His Val Ser lie Arg Lys 

180 185 190 

Gin Thr Lys Trp Trp Tyr Ala Lys Arg Phe Leu His Pro Asp lie Val 
195 200 205 

10 Ala Pro Tyr Glu Tyr lie Phe lie Trp Asp Glu Asp Leu Gly Val Glu 
210 215 220 

His Phe Asp Ser Glu Lys Tyr Leu Ala Val Val Lys Lys His Gly Leu 
225 230 235 " "* ~ 240 

Glu lie Ser Gin Pro Gly Leu Glu Pro Tyr Glu Gly Leu Thr Trp Glu 
15 245 250 255 

Met Thr Lys Lys Arg Asp Asp Thr Glu Val His Lys His Ala Glu Glu 

260 " 265 270 

Arg Asn Gly Trp Cys Thr Asp Pro Asn Leu Pro Pro Cys Ala Ala Phe 
275 280 285 

2 0 Val Glu He Met Ala Pro Val Phe Ser Arg Lys Ala Trp Arg Cys Val 
290 295 300 

Trp His Met He Gin Asn Asp Leu He His Gly Trp Gly Leu Asp Phe 
305 310 315 320 

Ala Val Arg Lys Cys Val Gin Asn Ala His Glu Lys He Gly Val Val 
25 " 325 330 " 335 

Asp Ala Gin Trp He He His Gin Gly Val Pro Ser Leu Gly Asn Gin 

340 345 350 

Gly Gin Pro Glu Gin Gly Lys Gin Pro Trp Glu Gly Val Arg Glu Arg 
355 360 "* 365 

30 Cys Arg Arg Glu Trp Thr Met Phe Gin Asp Arg Leu Asp Asp Ala Glu 
370 375 380 

Lys Ala Tyr Phe Glu Ala Ser Ala His Lys Asn Ala Ser Ser Arg Pro 
385 390 395 400 

His Gly Asn Trp Val 
35 " 405 

(2) INFORMATION FOR SEQ ID NO: 4 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids 

(B) TYPE: amino acid 
4 0 (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
45 (B) LOCATION: 1..404 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048117 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 
Met Arg Leu Leu Leu Thr Ser Phe Phe Gly Val He Val Gly Phe Leu 
15 10 15 

50 Met Gly He Thr Phe Pro Thr Leu Thr Leu Thr Lys Met Asn Leu Pro 

20 25 30 

Ser Thr Leu Phe Pro Ser He Asp Leu Ala Tyr He Glu Asp Lys Tyr 

35 4 0 4 5 

Ser Asp lie Ser Arg Gin Arg Leu Phe Gly Ser Trp Ser Ser Thr Lys 
55 50 55 60 

Gly Leu Lys Leu Lys Asn Asp He Pro Asp Pro Pro Tyr Asn Tyr Asn 
65 70 75 80 

Asp Thr Lys Val Asp Asp Arg Thr Phe Glu Leu Leu Gin He Trp Val 
'85 90 95 

60 Ser Thr Asn Pro Arg Gly Ala Glu Arg Leu Pro Pro Asp He Val Thr 

100 105 110 

Pro Glu Ser Asp Phe Tyr Leu Arg Arg Leu Trp Gly Asp Pro Asn Glu 
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10 



15 



20 



25 



30 



35 







115 










120 


Asp 


Leu 


Thr 


Val 


Lys 


Gin 


Arg 


Tyr 




130 










135 




Asp 


Gin 


Arg 


Lys 


Asn 


He 


Asp 


Thr 


145 










150 






Phe 


Ser 


He 


Met 


Leu 


Phe 


His 


Tyr 










165 








Glu 


Phe 


Glu 


Trp 


Ser 


Lys 


Arg 


Ala 








180 










Thr 


Lys 


Trp 


Trp 


Tyr 


Ala 


Lys 


Arg 






195 










200 


Pro 


Tyr 


Glu 


Tyr 


He 


Phe 


He 


Trp 




210 










215 




Phe 


Asp 


Ser 


Glu 


Lys 


Tyr 


Leu 


Ala 


225 










230 






lie 


Ser 


Gin 


Pro 


Gly 


Leu 


Glu 


Pro 










245 








Thr 


Lys 


Lys 


Arg 


Asp 


Asp 


Thr 


Glu 








260 










Asn 


Gly 


Trp 


Cys 


Thr 


Asp 


Pro 


Asn 






275 










280 


Glu 


He 


Met 


Ala 


Pro 


Val 


Phe 


Ser 




290 










295 




His 


Met 


He 


Gin 


Asn 


Asp 


Leu 


He 


305 










310 






Val 


Arg 


Lys 


Cys 


Val 


Gin 


Asn 


Ala 










325 








Ala 


Gin 


Trp 


He 


He 


His 


Gin 


Gly 








340 










Gin 


Pro 


Glu 


Gin 


Gly 


Lys 


Gin 


Pro 






355 










360 


Arg 


Arg 


Glu 


Trp 


Thr 


Met 


Phe 


Gin 




370 










375 




Ala 


Tyr 


Phe 


Glu 


Ala 


Ser 


Ala 


His 


385 










390 






Gly 


Asn 


Trp 


Val 











561 

Leu Val Thr 

Val Leu Lys 
155 

Asp Gly Arg 

170 
He His Val 
185 

Phe Leu His 

Asp Glu Asp 

Val Val Lys 
235 

Tyr Glu Gly 

250 

Val His Lys 
2 65 

Leu Pre Pro 

Arg Lys Ala 

His Gly Trp 
315 

His Glu Lys 
330 

Val Pro Ser 
345 

Trp Glu Gly 

Asp Arg Leu 

Lys Asn Ala 
395 



125 
Phe Thr 
140 

Lys Phe 

Ala Ser 

Ser He 

Pro Asp 
205 
Leu Gly 
22 0 

Lys His 

Leu Thr 

His Ala 

Cys Ala 
285 
Trp Arg 
300 

Gly Leu 

He Gly 

Leu Gly 

Val Arg 
365 
Asp Asp 
380 

Ser Ser 



Val Gly Tyr 
Ser 
Glu 



Arg 
190 
lie 

Val 

Gly 

Trp 

Glu 
27 0 
Ala 

Cys 

Asp 

Val 

Asn 
350 
Glu 

Ala 

Ara 



Asp Asn 
160 
Trp Glu 
175 

Lys Gin 

Val Ala 

Glu His 

Leu Glu 
240 
Glu Met 
255 

Glu Arg 

Phe Val 

Val Trp 

Phe Ala 
320 
Val Asp 
335 

Gin Gly 

Arg Cys 

Glu Lys 

Pro His 
400 



(2) INFORMATION FOR SEQ ID NO: 455: 
(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 649 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
4 5 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 64 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 20482 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

50 ggaatttctc actctctcta tctctcttag ccagctctga tcatttcgat 
tgaaaaggga ggagaaacat ggttgtacgt atcagactgt cgagatttgg 
eggecatttt ttagggttat ggctgetgat agcagatctc caagagaegg 
gaggtcttag gttacttcaa tcctttgcca ggccaggacg gtggtaagag 
aagttcgatc gaattaagta ctggttatct gttggtgctc agecatcaga 

55 cgtctcctat teagatcegg tttacttcct cctcctccaa tggtggctat 
ggtggagcac gagacacacg cccagttgat ccgatgactg gtcgctatgt 
aataaaacag ttaatgccaa tgataaccag cctaaggaag aggatacaga 
gcatgattca ttagecttet gtcategtag cttttcaagt tcactttgtt 
ttgtgtaatg cagcattaga caactgactt gtttcctttg tttggcgata 

60 tgtttggcac tttttgeaga aacggcacat attttgeatt gggatattt 
(2) INFORMATION FOR SEQ ID NO: 456: 
(i) SEQUENCE CHARACTERISTICS: 



71 

ttttttcegg 
atgcaaaaat 
gaagcatctt 
gatgggtctc 
cccggttcaa 
gggacgtaaa 
ggatgeagag 
agecaagagt 
gtcgattata 
aacggcaagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .135 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048272 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 

Met Val Val Arg lie Arg Leu Ser Arg Phe Gly Cys Lys Asn Arg Pro 
1 5 10 ~ " 15 

Phe Phe Arg Val Met Ala Ala Asp Ser Arg Ser Pro Arg Asp Gly Lys 
20 25 30 

15 His Leu Glu Val Leu Gly Tyr Phe Asn Pro Leu Pro Gly Gin Asp Gly 
35 40 45 

Gly Lys Arg Met Gly Leu Lys Phe Asp Arg lie Lys Tyr Trp Leu Ser 

50 55 60 

Val Gly Ala Gin Pro Ser Asp Pro Val Gin Arg Leu Leu Phe Arg Ser 
20 65 ~ 70 75 80 

Gly Leu Leu Pro Pro Pro Pro Met Val Ala Met Gly Arg Lys Gly Gly 

85 90 95 

Ala Arg Asp Thr Arg Pro Val Asp Pro Met Thr Gly Arg Tyr Val Asp 
100 105 110 

2 5 Ala Glu Asn Lys Thr Val Asn Ala Asn Asp Asn Gin Pro Lys Glu Glu 

115 " 120 125 

Asp Thr Glu Ala Lys Ser Ala 
130 135 
(2) INFORMATION FOR SEQ ID NO: 4 57: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
35 {ii} MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .115 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048273 
40 (xi) SEQUENCE DESCRIPTION.: SEQ ID NO: 457: 





Met 


Ala 


Ala 


Asp 


Ser 


Arg 


Ser 


Pro 


Arg 


Asp 


Gly 


Lys 


His 


Leu 


Glu 


Val 




1 








5 










10 










15 






Leu 


Gly 


Tyr 


Phe 
20 


Asn 


Pro 


Leu 


Pro 


Gly 
25 


Gin 


Asp 


Gly 


Gly 


Lys 
30 


Arg 


Met 


45 


Gly 


Leu 


Lys 
35 


Phe 


Asp 


Arg 


He 


Lys 
40 


Tyr 


Trp 


Leu 


Ser 


Val 
45 


Gly 


Ala 


Gin 




Pro 


Ser 
50 


Asp 


Pro 


Val 


Gin 


Arg 
55 


Leu 


Leu 


Phe 


Arg 


Ser 
60 


Gly 


Leu 


Leu 


Pro 




Pro 


Pro 


Pro 


Met 


Val 


Ala 


Met 


Gly 


Arg 


Lys 


Gly 


Gly 


Ala 


Arg 


Asp 


Thr 


50 


65 










70 










75 










80 




Arg 


Pro 


Val 


Asp 


Pro 
85 


Met 


Thr 


Gly 


Arg 


Tyr 
90 


Val 


Asp 


Ala 


Glu 


Asn 
95 


Lys 




Thr 


Val 


Asn 


Ala 


Asn 


Asp 


Asn 


Gin 


Pro 


Lys 


Glu 


Glu 


Asp 


Thr 


Glu 


Ala 



100 105 110 

55 Lys Ser Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 458: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 4 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .84 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048274 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



Met 


Gly 


Leu 


Lys 


Phe 


Asp 


Arg 


lie Lys 


Tyr 


Trp 


Leu 


Ser 


Val 


Gly 


Ala 


i 








5 








10 










15 




Gin 


Pro 


Ser 


Asp 


Pro 


Val 


Gin 


Arg Leu 


Leu 


Phe 


Arg 


Ser 


Gly 


Leu 


Leu 








20 








25 










30 






Pro 


Pro 


Pro 


Pro 


Met 


Val 


Ala 


Met Gly 


Arg 


Lys 


Gly 


Gly 


Ala 


Arg 


Asp 






35 










40 








45 








Thr 


Arg 


Pro 


Val 


Asp 


Pro 


Met 


Thr Gly 


Arg 


Tyr 


Val 


Asp 


Ala 


Glu 


Asn 




50 










55 








60 










Lys 


Thr 


Val 


Asn 


Ala 


Asn 


Asp 


Asn Gin 


Pro 


Lys 


Glu 


Glu 


Asp 


Thr 


Glu 


65 










70 








75 










80 


Ala 


Lys 


Ser 


Ala 
























(2) 


INFORMATION 


FOR 


SEQ 


id : 


UO: 4 59: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .285 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048331 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 9 : 
atgaaactgt gtaatcacca tactggaaag tttgtttcgg agagcgtcga agatcagact 60 
gagcaggtac tcaaaaacat gggggagata ttgaaagcta gtggtgctga ttattcctcg 120 
gtggtgaaga caacaatcat gttggctgat ttggctgact tcaagacagt gaacgagata 180 
tatgccaaat acttcccagc tccttctcca gcacgatcga cgtatcaagt tgcagctttg 240 
cctctaaacg ccaagatcga gattgaatgt attgcaacac tctag 
(2) INFORMATION FOR SEQ ID NO: 4 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY : peptide 

(B) LOCATION: 1 . . 94 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048332 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 



Met 


Lys 


Leu 


Cys 


Asn 


His 


His 


Thr 


Gly 


Lys 


Phe 


Val 


Ser 


Glu 


Ser 


Val 


1 








5 










10 










15 




Glu 


Asp 


Gin 


Thr 


Glu 


Gin 


Val 


Leu 


Lys 


Asn 


Met 


Gly 


Glu 


He 


Leu 


Lys 








20 










25 










30 






Ala 


Ser 


Gly 


Ala 


Asp 


Tyr 


Ser 


Ser 


Val 


Val 


Lys 


Thr 


Thr 


He 


Met 


Leu 






35 










40 










45 








Ala 


Asp 


Leu 


Ala 


Asp 


Phe 


Lys' 


Thr 


Val 


Asn 


Glu 


He 


Tyr 


Ala 


Lys 


Tyr 




50 










55 










60 










Phe 


Pro 


Ala 


Pro 


Ser 


Pro 


Ala 


Arg 


Ser 


Thr 


Tyr 


Gin 


Val 


Ala 


Ala 


Leu 


65 










70 










75 










80 


Pro 


Leu 


Asn 


Ala 


Lys 


He 


Glu 


He 


Glu 


Cys 


He 


Ala 


Thr 


Leu 







85 90 
(2) INFORMATION FOR SEQ ID NO: 461: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 68 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048333 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 



Met 


Gly 


Glu 


He 


Leu 


Lys 


Ala 


Ser 


Gly 


Ala 


Asp 


Tyr 


Ser 


Ser 


Val 


Val 


1 








5 










10 










15 




Lys 


Thr 


Thr 


He 


Met 


Leu 


Ala 


Asp 


Leu 


Ala 


Asp 


Phe 


Lys 


Thr 


Val 


Asn 








20 










25 










30 






Glu 


He 


Tyr 


Ala 


Lys 


Tyr 


Phe 


Pro 


Ala 


Pro 


Ser 


Pro 


Ala 


Arg 


Ser 


Thr 






35 










40 










45 








Tyr 


Gin 


Val 


Ala 


Ala 


Leu 


Pro 


Leu 


Asn 


Ala 


Lys 


He 


Glu 


He 


Glu 


Cys 



50 55 60 



He Ala Thr Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 4 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..48 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048334 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:462: 
Met Leu Ala Asp Leu Ala Asp Phe Lys Thr Val Asn Glu He Tyr Ala 
1 5 10 15 

Lys Tyr Phe Pro Ala Pro Ser Pro Ala Arg Ser Thr Tyr Gin Val Ala 

2 0 2 5 30 

Ala Leu Pro Leu Asn Ala Lys He Glu He Glu Cys He Ala Thr Leu 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 463: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 627 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048466 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 



atgatggaca agaatgaaaa ccctagtttt ttgctctctc gaagttccgt gcgactctct 60 

gctcgagcaa gatctcctac cgatacaatg attatctcag aggctaaccg caaagaaatc 120 

tgcaagtacc tcttcaaaga aggagtttgc tttgcgaaga aggatttcaa tcttgctaag 180 

catccgttga tcgatgtccc caacctacaa gtgattaagc ttatgcagag tttcaaatcc 240 

aaggagtatg ttagggagac atttgcctgg atgcattact attggttttt gaccaatgaa 300 

gggatcgagt tcttgagaac ttatcttaat cttccatctg atgttgttcc tgctactttg 360 

aagaagtcag ctaagcctgg tggtcgtcca tttggtggcc cacctggtga tcgctcaaga 420 

ggacctcgcc atgaaggagg agaccgtccc aggtttggtg accgtgatgg gtaccgtgca 480 

ggtcctcgag ctggtggtga gtttggaggt gaaaagggtg gagctcccgc agattaccag 540 

ccatctttcc aaggaagtgg ccgtggtttt ggccgtggtg ctggtggcta cagcgcagct 600 
gcaccatctg gttcaggttt gccttga 
(2) INFORMATION FOR SEQ ID NO: 4 64: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH; 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .208 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048467 
10 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 464: 



Met 


Met 


Asp 


Lys 


Asn 


Glu 


Asn 


Pro 


Ser 


Phe 


Leu 


Leu 


Ser 


Arg 


Ser 


Ser 


1 








5 










10 










15 




Val 


Arg 


Leu 


Ser 


Ala 


Arg 


Ala 


Arg 


Ser 


Pro 


Thr 


Asp 


Thr 


Met 


He 


He 








20 










25 










30 






Ser 


Glu 


Ala 


Asn 


Arg 


Lys 


Glu 


lie 


Cys 


Lys 


Tyr 


Leu 


Phe 


Lys 


Glu 


Gly 






35 










40 










45 








Val 


Cys 


Phe 


Ala 


Lys 


Lys 


Asp 


Phe 


Asn 


Leu 


Ala 


Lys 


His 


Pro 


Leu 


He 




50 










55 










60 










Asp 


Val 


Pro 


Asn 


Leu 


Gin 


Val 


He 


Lys 


Leu 


Met 


Gin 


Ser 


Phe 


Lys 


Ser 


65 










70 










75 










80 . 


Lys 


Glu 


Tyr 


Val 


Arg 


Glu 


Thr 


Phe 


Ala 


Trp 


Met 


His 


Tyr 


Tyr 


Trp 


Phe 



85 90 95 

Leu Thr Asn Glu Gly He Glu Phe Leu Arg Thr Tyr Leu Asn Leu Pro 
100 105 110 

25 Ser Asp Val Val Pro Ala Thr Leu Lys Lys Ser Ala Lys Pro Gly Gly 
115 120 125 



Arg 


Pro 


Phe 


Gly 


Gly 


Pro 


Pro 


Gly 


Asp 


Arg 


Ser 


Arg 


Gly 


Pro 


Arg 


His 




130 










135 










140 










Glu 


Gly 


Gly 


Asp 


Arg 


Pro 


Arg 


Phe 


Gly 


Asp 


A.rg 


Asp 


Gly 


Tyr 


Arg 


Ala 


145 










150 










155 










160 


Gly 


Pro 


Arg 


Ala 


Gly 


Gly 


Glu 


Phe 


Gly 


Gly 


Glu 


Lys 


Gly 


Gly 


Ala 


Pro 










165 










170 










175 




Ala 


Asp 


Tyr 


Gin 


Pro 


Ser 


Phe 


Gin 


Gly 


Ser 


Gly 


Arg 


Gly 


Phe 


Gly 


Arg 








180 










185 










190 






Gly 


Ala 


Gly 


Gly 


Tyr 


Ser 


Ala 


Ala 


Ala 


Pro 


Ser 


Gly 


Ser 


Gly 


Leu 


Pro 






195 










200 










205 









(2) INFORMATION FOR SEQ ID NO:465: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 5 (A) NAME/KEY: peptide 

(B) LOCATION: 1..207 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048468 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:465: 





Met 


Asp 


Lys 


Asn 


Glu 


Asn 


Pro 


Ser 


Phe 


Leu 


Leu 


Ser 


Arg 


Ser 


Ser 


Val 


50 


1 








5 










10 










15 






Arg 


Leu 


Ser 


Ala 
20 


Arg 


Ala 


Arg 


Ser 


Pro 
25 


Thr 


Asp 


Thr 


Met 


He 
30 


He 


Ser 




Glu 


Ala 


Asn 
35 


Arg 


Lys 


Glu 


He 


Cys 
40 


Lys 


Tyr 


Leu 


Phe 


Lys 
45 


Glu 


Gly 


Val 


55 


Cys 


Phe 
50 


Ala 


Lys 


Lys 


Asp 


Phe 
55 


Asn 


Leu 


Ala 


Lys 


His 
60 


Pro 


Leu 


He 


Asp 




Val 


Pro 


Asn 


Leu 


Gin 


Val 


He 


Lys 


Leu 


Met 


Gin 


Ser 


Phe 


Lys 


Ser 


Lys 




65 










70 










75 










80 




Glu 


Tyr 


Val 


Arg 


Glu 


Thr 


Phe 


Ala 


Trp 


Met 


His 


Tyr 


Tyr 


Trp 


Phe 


Leu 


60 










85 










90 










95 






Thr 


Asn 


Glu 


Gly 


He 


Glu 


Phe 


Leu 


Arg 


Thr 


Tyr 


Leu 


Asn 


Leu 


Pro 


Ser 



100 105 110 
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Asp 


Val 


Val 
115 


Pro 


Ala 


Thr 


Leu 


Lys 
120 


Lys 


Ser 


Ala 


Lys 


Pro 
125 


Gly 


Gly 


Arg 




Pro 


Phe 
130 


Gly 


Gly 


Pro 


Pro 


Gly 
135 


Asp 


Arg 


Ser 


Arg 


Gly 
140 


Pro 


Arg 


His 


Glu 


5 


Gly 
145 


Gly 


Asp 


Arg 


Pro 


Arg 
150 


Phe 


Gly 


Asp 


Arg 


Asp 
155 


Gly 


Tyr 


Arg 


Ala 


Gly 
160 




Pro 


Arg 


Ala 


Gly 


Gly 
165 


Glu 


Phe 


Gly 


Gly 


Glu 
170 


Lys 


Gly 


Gly 


Ala 


Pro 
175 


Ala 


10 


Asp 


Tyr 


Gin 


Pro 
180 


Ser 


Phe 


Gin 


Gly 


Ser 
185 


Gly 


Arg 


Gly 


Phe 


Gly 
190 


Arg 


Gly 




Ala 


Gly 


Gly 
195 


Tyr 


Ser 


Ala 


Ala 


Ala 
200 


Pro 


Ser 


Gly 


Ser 


Gly 
205 


Leu 


Pro 






(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 4 66: 

















(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

2 0 (ix) FEATURE: 

(A) NAME/KEY; peptide 

(B) LOCATION : 1 . . 17 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 2048469 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 



25 


Met 
1 


He 


He 


Ser 


Glu 
5 


Ala 


Asn 


Arg 


Lys 


Glu 
10 


He 


Cys 


Lys 


Tyr 


Leu 
15 


Phe 




Lys 


Glu 


Gly 


Val 


Cys 


Phe 


Ala 


Lys 


Lys 


Asp 


Phe 


Asn 


Leu 


Ala 


Lys 


His 




* 






20 










25 










30 








Pro 


Leu 


He 


Asp 


Val 


Pro 


Asn 


Leu 


Gin 


Val 


He 


Lys 


Leu 


Met 


Gin 


Ser 


30 






35 










40 










45 










Phe 


Lys 
50 


Ser 


Lys 


Glu 


Tyr 


Val 
55 


Arg 


Glu 


Thr 


Phe 


Ala 
60 


Trp 


Met 


His 


Tyr 




Tyr 


Trp 


Phe 


Leu 


Thr 


Asn 


Glu 


Gly 


He 


Glu 


Phe 


Leu 


Arg 


Thr 


Tyr 


Leu 




65 










70 










75 










80 


35 


Asn 


Leu 


Pro 


Ser 


Asp 
85 


Val 


Val 


Pro 


Ala 


Thr 
90 


Leu 


Lys 


Lys 


Ser 


Ala 
95 


Lys 




Pro 


Gly 


Gly 


Arg 
100 


Pro 


Phe 


Gly 


Gly 


Pro 
105 


Pro 


Gly 


Asp 


Arg 


Ser 
110 


Arg 


Gly 




Pro 


Arg 


His 


Glu 


Gly 


Gly 


Asp 


Arg 


Pro 


Arg 


Phe 


Gly 


Asp 


Arg 


Asp 


Gly 


40 






115 










120 










125 










Tyr 


Arg 
130 


Ala 


Gly 


Pro 


Arg 


Ala 
135 


Gly 


Gly 


Glu 


Phe 


Gly 
140 


Gly 


Glu 


Lys 


Gly 




Gly 


Ala 


Pro 


Ala 


Asp 


Tyr 


Gin 


Pro 


Ser 


Phe 


Gin 


Gly 


Ser 


Gly 


Arg 


Gly 




145 










150 










155 










160 


45 


Phe 


Gly 


Arg 


Gly 


Ala 
165 


Gly 


Gly 


Tyr 


Ser 


Ala 
170 


Ala 


Ala 


Pro 


Ser 


Gly 
175 


Ser 



Gly Leu Pro 



(2) INFORMATION FOR SEQ ID NO: 4 67: 
5 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : 1 . . 58 5 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050485 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

atcttcggaa agtctcattt ctcgatcccc aattcgtgga ttagggttaa aagaaccatt 60 
tttattctcg tcgcgcaaca acaaatccag atcgaaaaag gaagaagaga tcgaaatggc 120 



WO 00/40695 



PCT/US00/00466 



567 

gttgagaagg gtttacagtg aaatcagagg gaagaaggtg acggagcttc caggctatat 180 
caaatcgact ttttcaatgg agaccgtgaa gacctctgtg aagagaggac tcgataacta 240 
caacgaaaaa tacattcaga ccagctccgt tgatcctatc cttcatatct gcttctacgg 300 
catggctttc tcttaccttg tcgctctccc taatgagcgt cgccatcttg agcatcagca 360 
5 gcatgctaag gagcacggtg gtcattgatc tcgtgggaat cgtttcgatc tcgagatgat 420 
tttagggggg ttgctgtgaa atctttctct gcttgatggt gacgacgact caagaattgt 480 
gtcttattgt ttcgttttct tgaattttcc tggataatgt tgacctaaag gaaaaccttt 540 
ctttcgaatt acactccatg atagtcaata attgaagcat catga 
(2) INFORMATION FOR SEQ ID NO: 4 68: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
15 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..128 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050486 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



25 



Ser 


Ser 


Glu 


Ser 


Leu 


He 


Ser 


Arg 


Ser 


Pro 


He 


Arg 


Gly 


Leu 


Gly 


Leu 


1 








5 










10 










15 




Lys 


Glu 


Pro 


Phe 


Leu 


Phe 


Ser 


Ser 


Arg 


Asn 


Asn 


Lys 


Ser 


Arg 


Ser 


Lys 








20 










25^ 










30 






Lys 


Glu 


Glu 


Glu 


He 


Glu 


Met 


Ala 


Leu 


Arg 


Arg 


Val 


Tyr 


Ser 


Glu 


He 






35 










40 










45 








Arg 


Gly 


Lys 


Lys 


Val 


Thr 


Glu 


Leu 


Pro 


Gly 


Tyr 


He 


Lys 


Ser 


Thr 


Phe 




50 










55 










60 










Ser 


Met 


Glu 


Thr 


Val 


Lys 


Thr 


Ser 


Val 


Lys 


Arg 


Gly 


Leu 


Asp 


Asn 


Tyr 


65 










70 










75 










80 


Asn 


Glu 


Lys 


Tyr 


He 


Gin 


Thr 


Ser 


Ser 


Val 


Asp 


Pro 


He 


Leu 


His 


He 










85 










90 










95 




Cys 


Phe 


Tyr 


Gly 


Met 


Ala 


Phe 


Ser 


Tyr 


Leu 


Val 


Ala 


Leu 


Pro 


Asn 


Glu 








100 










105 










110 






Arg 


Arg 


His 


Leu 


Glu 


His 


Gin 


Gin 


His 


Ala 


Lys 


Glu 


His 


Gly 


Gly 


His 






115 










120 










125 









30 



35 

(2) INFORMATION FOR SEQ ID NO: 469: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 
4 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050487 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 69: 

Met 

50 



55 



60 85 90 

(2) INFORMATION FOR SEQ ID NO: 470: 
(i) SEQUENCE CHARACTERISTICS: 



Met 


Ala 


Leu 


Arg 


Arg 


Val 


Tyr 


Ser 


Glu 


He 


Arg 


Gly 


Lys 


Lys 


Val 


Thr 


1 








5 










10 










15 




Glu 


Leu 


Pro 


Gly 


Tyr 


He 


Lys 


Ser 


Thr 


Phe 


Ser 


Met 


Glu 


Thr 


Val 


Lys 








20 










25 










30 






Thr 


Ser 


Val 


Lys 


Arg 


Gly 


Leu 


Asp 


Asn 


Tyr 


Asn 


Glu 


Lys 


Tyr 


He 


Gin 






35 










40 










45 








Thr 


Ser 


Ser 


Val 


Asp 


Pro 


He 


Leu 


His 


He 


Cys 


Phe 


Tyr 


Gly 


Met 


Ala 




50 










55 










60 










Phe 


Ser 


Tyr 


Leu 


Val 


Ala 


Leu 


Pro 


Asn 


Glu 


Arg 


Arg 


His 


Leu 


Glu 


His 


65 










70 










75 










80 


Gin 


Gin 


His 


Ala 


Lys 


Glu 


His 


Gly 


Gly 


His 
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(A) LENGTH : 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : ■ 1. .516 

(D) OTHER INFORMATION: / Ceres Seq . ID 2050708 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

caaccaaaac aaaacataaa aaacaagtgg aagctttaaa acgagaggga gagagcaaaa 60 
atggcgacgt cgggaacgta cgtgacggaa gttccgcraa aaggatcggc cgagaaatac 120 
tacaagaggt ggaagaacga gaaccatgtc ttccctgatg ctatcggcca ccacatccaa 180 
aatgttaccg ttcacgaagg cgaacatgac tctcacgggt ctatcaggag ttggaactac 240 

15 acatgggatg gaaaggagga ggtgttcaag gagagaagag agatagacga tgagaccaaa 300 
acgtrgacgt taagaggact tgagggtcac gtgatggagc agctcaaagt gtacgacgtc 360 
gtctaccaat tcattcccaa atctgaggat acctgcatcg gcaaaatcac tttaatatgg 420 
gagaagcgca acgatgattc cccagaacca agcgggtaca tgaaattcgt caagagcttg 480 
gttgctgaca tgggaaacca cgttagcaaa acttaa 

2 0 ■. 2) INFORMATION FOR SEQ ID NO : 4 7 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 151 amino acids 

(B) TYPE:, amino acid 

(C) STRANDEDNESS: 

2 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .151 

30 (D) OTHER INFORMATION: / Ceres Seq. ID 2050709 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 1: 



35 



Met 


Ala 


Thr 


Ser 


Gly 


Thr 


Tyr 


Val 


Thr 


Glu 


Val 


Pro 


Leu 


Lys 


Gly 


Ser 


1 








5 










10 










15 




Ala 


Glu 


Lys 


Tyr 


Tyr 


Lys 


Arg 


Trp 


Lys 


Asn 


Glu 


Asn 


His 


Val 


Phe 


Pro 








20 










25 










30 






Asp 


Ala 


He 


Gly 


His 


His 


He 


Gin 


Asn 


Val 


Thr 


Val 


His 


Glu 


Gly 


Glu 






35 










40 










45 








His 


Asp 


Ser 


His 


Gly 


Ser 


He 


Arg 


Ser 


Trp 


Asn 


Tyr 


Thr 


Trp 


Asp 


Gly 




50 










55 










60 










Lys 


Glu 


Glu 


Val 


Phe 


Lys 


Glu 


Arg 


Arg 


Glu 


He 


Asp 


Asp 


Glu 


Thr 


Lys 


65 










70 










75 










80 


Thr 


Leu 


Thr 


Leu 


Arg 


Gly 


Leu 


Glu 


Gly 


His 


Val 


Met 


Glu 


Gin 


Leu 


Lys 










85 










90 










95 




Val 


Tyr 


Asp 


Val 


Val 


Tyr 


Gin 


Phe 


He 


Pro 


Lys 


Ser 


Glu 


Asp 


Thr 


Cys 








100 










105 










110 






He 


Gly 


Lys 


He 


Thr 


Leu 


He 


Trp 


Glu 


Lys 


Arg 


Asn 


Asp 


Asp 


Ser 


Pro 






115 










120 










125 








Glu 


Pro 


Ser 


Gly 


Tyr 


Met 


Lys 


Phe 


Val 


Lys 


Ser 


Leu 


Val 


Ala 


Asp 


Met 




130 










135 










140 










Gly 


Asn 


His 


Val 


Ser 


Lys 


Thr 





















40 



45 



50 

145 150 
(2) INFORMATION FOR SEQ ID NO: 472: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1730 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
60 (A) NAME /KEY : - 

(B) LOCATION: 1 . . 1730 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050901 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 



atcgcgtgt t 
aagcatt ctt 
ctatggagat 
accactgccc 



actcctttct aaaggaacca 
gctctccaag ggattcacgg 



acttcagaac atccgtgatc 
tgaaagtaaa aaacagaacc 
cgagaatcac cgaaataagc cttctctcta 
ggcagttgct tctaacaaac atcttagaat 
tctaccttaa tatcggaact 



tcaccctcac 
gacctattct 



ggcaaagaag 
attcctcctc tcctccacta 



ttgtatatca tggaaggctt gacttacttg 
ttccttctca gctgaattct cttgagtacg 
cctacgaaaa cgccaacatt gctcttccag 
aaaaacagag cattgttcga tataaaagct 
gcaggttttg gaagatcata taccgtacaa 
ctcttgtagt cggtcttgtc ttaggcacta 
gaatcaggaa acgatttggc 
cccaaaccct tccaatattc 



tctccgagaa acctcaagcg gactctacag actctcctct 



caaacacttt ggttttcttg ccatacttgc tactcatcgc 



tctatttcct 
r ctggat cat 
actacattgc 
gtt actt cat 
tgtacaagta 
gcctggtctg 
tagacaagaa 
tatccggttt 
tactcccctt 



tgtaggactc tgcttttcat 
tgtcctaatg gctaactctt 
tggaacatct tcagtgacca 
ctctaaagag agtcttccca 



tgcgttggac 
gtttgaggaa 
tgggct teat 
caaacttgag 
tqetcaaget 



gcactt ctga 



ggcaagctct 
rtgtactttt 
ttcttcttgc 
agtactggct 
taaatgagta 



aatcatctac 
cgcctacttt 
cttgagctct 
ggctttcttc 



ettttegcat 
attgatgaac 
cacattcttg 
tctgtctcac 
gtgctcgtaa 
ctcgcaccta 
ttgttctctg 



cttcatgtac ttcttctcaa 
ctcgtgtctg cacaacaagt 



gcttctgtga atagctgett 
gagagacaga ggatatatgt 



agttactgga 
gttgggacgt 



ggtgacgtgt 
ccgtcattgg 



ggaartatcc gagtattaat 
caategcaga gagatctgea 



ttgctaagaa 
tgattaagct 
tgcattatta 



ggatt tcaar 
catgeagagt 



aaacaaggaa 
cagtgect ct 
tgtgat cttg 
aatacct ttt 
ct cccaaagc 
ttcaaatcca 



ttccttccga tgttgtccct 
ttggtggccc acctggtgat 



ttggtttctg actaatgaag 
get actt tga 



t aegtttett 
ttgcagcgaa 
ttcgtccttt 
caaagctagt 
at ccgt tgat 
aggagtacgt 
gaattgagtt 
agaagtctgc 



gagaagcaag aaaaccctat 
agctcaacca acaagacgat 
tatcagatta tctcagagaa 
ggaaccagag ggagtatget 



tgatgtacca 
t agggagaca 
cttgagaact 
raagcccggt 



aacctgeaag 
t ttgcgtgga 
tatcttaacc 
ggtcgtccct 



cgecaaagag gaccacctcg ctctgatgga gaccgtccca 
taccgtggag gcccacgtgg 



gaggtggtgg 

ctggttcagg gttcccttga 



gatttggtga ccgtgatgga 
ctccagctga tttccagccg tctttccaag 
gtgeaggegg ttacagtgca gcagcaccat 
(2) INFORMATION FOR SEQ ID MO: 47 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 57 5 

(D) OTHER INFORMATION: / Ceres Seq. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: 



tggtgatgag aagggtggag 
taggcctggt tttggccgtg 



ID 2050902 



Arg 


Val 


Leu 


Leu 


Leu 


Ser 


Lys 


Gly 


Thr 


He 


Val 


Tyr 


His 


Gly 


Arg 


Leu 


1 








5 










10 










15 




Asp 


Leu 


Leu 


Glu 


Ala 


Phe 


Leu 


Leu 


Ser 


Lys 


Gly 


Phe 


Thr 


Val 


Pro 


Ser 








20 










25 










30 






Gin 


Leu 


Asn 


Ser 


Leu 


Glu 


Tyr 


Ala 


Met 


Glu 


He 


Leu 


Gin 


Asn 


He 


Arg 






35 










40 










45 








Asp 


Pro 


Tyr 


Glu 


Asn 


Ala 


Asn 


He 


Ala 


Leu 


Pro 


Asp 


His 


Cys 


Pro 


Glu 




50 










55 










60 










Ser 


Lys 


Lys 


Gin 


Asn 


Gin 


Lys 


Gin 


Ser 


He 


Val 


Arg 


Tyr 


Lys 


Ser 


Ser 


65 










70 










75 










80 


Arg 


He 


Thr 


Glu 


He 


Ser 


Leu 


Leu 


Ser 


Ser 


Arg 


Phe 


Trp 


Lys 


He 


He 










85 










90 










95 




Tyr 


Arg 


Thr 


Arg 


Gin 


Leu 


Leu 


Leu 


Thr 


Asn 


He 


Leu 


Glu 


Ser 


Leu 


Val 








100 










105 










110 






Val 


Gly 


Leu 


Val 


Leu 


Gly 


Thr 


He 


Tyr 


Leu 


Asn 


He 


Gly 


Thr 


Gly 


Lys 






115 










120 










125 








Glu 


Gly 


He 


Arg 


Lys 


Arg 


Phe 


Gly 


Leu 


Phe 


Ala 


Phe 


Thr 


Leu 


Thr 


Phe 




130 










135 










140 










Leu 


Leu 


Ser 


Ser 


Thr 


Thr 


Gin 


Thr 


Leu 


Pro 


lie 


Phe 


lie 


Asp 


Glu 


Arg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



145 



150 



155 



160 
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Pro 


lie 


Leu 


Leu 


Arg 
165 


Glu 


Thr 


Ser 


Ser 


ux y 

17 0 


Leu 


Tvr 
i y x 


Arg 


Leu 


Ser 
175 


Ser 


His 


lie 


Leu 


Ala 
180 


Asn 


Thr 


Leu 


Val 


Phe 
185 


Leu 


Pro 


Tvr 
y 


Leu 


Leu 
190 


Leu 


lie 


Ala 


I le 


lie 
195 


T vr 
i yx 


Ser 


Val 


Ser 


Leu 

200 


Twr 
i yx 


Phe 


Leu 


Val 


Gly 
205 


Leu 


Cys 


Phe 


Ser 


Trn 
210 


Gin 


Ala 


Leu 


Ala 


i y x 

215 


Phe 


Val 


Leu 


Val 


lie 

220 


T rn 


lie 


lie 


Val 


j_itr Li 


Met 


Ala 


rio 1 1 




JT 1 1 


Val 


Leu 


Ph p» 
tr i 1 tr 


Leu 


Ser 


Ser 


Leu 


Ala 


Pro 


As n 


225 










230 










235 










240 


Tvr 

i yr 


T 1 p> 
j_ X tr 


A 1 ;=» 


C 7 V 
u x y 


Thr 
9 4 S 

4 J 


O tr X 


O tr x 


v a X 


Thr 
1 111 


T 1 p> 
lie 

? s n 


T <c> 1 1 
Xj tr LI 


T , pi i 
Xj tr U 


A 1 P\ 

r\-L cl 


AJ_a 


Phe 

255 


Phe 




Phe 


Ser 


Gly 
260 


T\/T 

i yx 


Phe 


lie 


Ser 


J-i y s 
265 


Gl n 

Ul Li 


Ser 


Leu 


Pro 


Lys 
270 


T vr 
i yx 


T rn 

I X LJ 


■i-iSU 


Phe 


Met 
275 


T vr 
i y x 


Phe 


Phe 


o tr x 


Mpr 
i v itr l 

280 


T \ 7T 

1 y r 


Lys 


T ■< ? r 

J 


Ala 


Leu 

285 


Asp 


Ala 


Leu 


Leu 


lie 
290 


As n 


UlU 


i yr 


Ser 


Cys 
295 


T ,p* i i 
Xitr Li 


n x o 


Z\ q n 

rtO J 1 


Lys 


Cys 
300 


Leu 


Val 


T rn 

1 X LJ 


Phe 


Glu 


Glu 


Ala 


Ser 


Val 


Asn 


Ser 




Leu 


v dl 


Thr 


rz 1 
oJ-y 


Gl v 


Asp 


Val 


Leu 


30 5 










310 










315 










320 


. .3D 


" "S 




u — y 


Leu 
32 5 


r± x o 


UlU 


Arg 


UiD 


33 0 


T 1 o 

X 1 tr 


T' *r 


Val 


Leu 


Gl v 

Uly 

335 


f-^r g 


p 27 o 


Ser 


Leu 


Val 
340 


Ser 


gi v 

uiy 


Jr 1 1 tr 


1j y s 


L6U 

345 


Ulu 


T w c; 

j_i y o 


Gin 


Gly 


lie 
350 


pro 


Phe 




j-.rg 


O tr X 

355 


Two 


L y s 


1 1 1 X 


Xjeu 


T 

Xj tr U 

360 


Leu 


Pro 


Php 
JT i Itr 


M-l. cl 


C 1 n 
u X 1 1 

365 


A 1 Pi 

n.1 cx 


Ul 1 1 






- i 1 tr 

370 


A 1 3 




L y s 


H "I .a 
nld 


UlU 

375 


Pro 


Thr 
1 1 1 X 


Ar g 


Arg 


T rn 

i rp 
380 


A c; r-i 
no 1 1 


1 \ X 


P r r^i 
JT X u 


p -f- 

o tr X 


T 1 p» 
X X tr 








j_j eu 


v dl 


i-ix y 


P r r\ 
Jr X CJ 


Ph p 


lie 


L\ rn 
t\L LJ 


Leu 


o tr x 


Uj.11 


A ro* 
rVl -1 


Th r 

1 L LX 


385 










390 










395 










400 


Tip 
1 X e 


.Kid 


ulU 


Arg 


O tr X 

4 n r 

4 U J 


Ala 


As n 


1 111 


Php 

ir ne 


Q p r 
4 x u 


Lys 


Leu 


V ct X 


u X U 


P r r>> 
JT X D 

■4 X O 


u x u 


uiy 


v ct 1 


^yo 


Ph o 
JTi Itr 

420 


A 1 a 


L y s 


L y s 


As p 


Pho 

Jr ne 
425 


As n 


Leu 


P r e~\ 
JrlU 


Lys 


IT. X O 

430 


P r r-i 
riu 


LSU 


X X tr 


As p 


V dl 

435 


L IU 


no 1 1 


Leu 


u 1 1 1 


V d X 

440 


Tip 

x x e 


L y s 


Leu 


L v ltr L 


C 1 n 

U X 1 1 

445 


o tr x 


Phe 


Lys 


Ser 


450 


U-LU 


T W -K- 


Val 


Ar cr 


Ul u 

455 


1 i i X 


php 

Jr i itr 


nl cl 


T rn 

i rp 


L v itr l 

460 


His 


T vr 


Tvr 
i yx 


T rn 

i X LJ 


a. I it: 


Tom 
j-j tr Li 


Th r 

1 I IX 


/\s n 


HI n 
u J_ Li 


n 1 v 


T 1 p> 
x x tr 


uX u. 


Phe 
Jr f ie 


Leu 


/\r g 


Thr 

1 I IX 


Twr 

i yr 


ii tr u 


7\ Q f~> 

rA.o 1 1 


Leu 


465 










470 










475 










480 


JTX (J 




jrio LJ 


val 


V ct X 

4 8 5 


tr x cj 


A 1 ^ 
i-iX cl 


T h -r- 


Leu 


4 on 

h y kj 


ijy ^> 


Q p r 
o tr X 


A 1 ^ 
r\X cl 


Lys 


Pro 
riu 

4 95 




uiy 


A r n 




Jr rie 
500 


uiy 


uiy 


Jri O 


Pr o 


uiy 
505 




Arg 


uX 11 


ri.X LJ 


ux y 
510 


P r 
JTX u 


P r r^s 
Jr X u 


Arg 


Q £=i v- 

o e r 


Asp 
515 


ux y 


Asp 


Arg 


Pro 


Arg 

520 


o K ^ 

rne 


bi y 


Asp 


Arg 


Asp 
525 


ux y 


Tyr 


Arg 


Gly 


Gly 
530 


Pro 


Arg 


Gly 


Gly 


Asp 
535 


Glu 


Lys 


Gly 


Gly 


Ala 
540 


Pro 


Ala 


Asp 


Phe 


Gin 


Pro 


Ser 


Phe 


Gin 


Gly 


Gly 


Gly 


Gly 


Arg 


Pro 


Gly 


Phe 


Gly 


Arg 


Gly 


545 










550 










555 










560 


Ala 


Gly 


Gly 


Tyr 


Ser 
5 65 


Ala 


Ala 


Ala 


Pro 


Ser 
570 


Gly 


Ser 


Gly 


Phe 


Pro 
575 





(2) INFORMATION FOR SEQ ID NO: 47 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 535 
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(D) OTHER INFORMATION: / Ceres Seq. ID 2050903 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 
Met Glu lie Leu Gin Asn lie Arg Asp Pro Tyr Glu Asn Ala Asn lie 
1 5 10 15 

5 Ala Leu Pro Asp His Cys Pro Glu Ser Lys Lys Gin Asn Gin Lys Gin 
20 25 30 

Ser lie Val Arg Tyr Lys Ser Ser Arg lie Thr Glu lie Ser Leu Leu 

35 " 4 0 4 5 

Ser Ser Arg Phe Trp Lys lie lie Tyr Arg Thr Arg Gin Leu Leu Leu 
10 50 55 60 

Thr Asn lie Leu Glu Ser Leu Val Val Gly Leu Val Leu Gly Thr lie 
65 70 75 80 

Tyr Leu Asn lie Gly Thr Gly Lys Glu Gly lie Arg Lys Arg Phe Gly 
85 90 95 

15 Leu Phe Ala Phe Thr Leu Thr Phe Leu Leu Ser Ser Thr Thr Gin Thr 

100 105 110 

Leu Pro lie Phe lie Asp Glu Arg Pro lie Leu Leu Arg Glu Thr Ser 

115 120 125 

Ser Gly Leu Tyr Arg Leu Ser Ser His lie Leu Ala Asn Thr Leu Val 
20 130 135 140 

Phe Leu Pro Tyr Leu Leu Leu lie Ala lie lie Tyr Ser Val Ser Leu 
145 150 155 160 

Tyr Phe Leu Val Gly Leu Cys Phe Ser Trp Gin Ala Leu Ala Tyr Phe 
165 170 175 

2 5 Val Leu Val lie Trp lie lie Val Leu Met Ala Asn Ser Phe Val Leu 

180 185 190 

Phe Leu Ser Ser Leu Ala Pro Asn Tyr He Ala Gly Thr Ser Ser Val 

195 200 " 205 

Thr He Leu Leu Ala Ala Phe Phe Leu Phe Ser Gly Tyr Phe He Ser 
30 210 215 220 

Lys Glu Ser Leu Pro Lys Tyr Trp Leu Phe Met Tyr Phe Phe Ser Met 
225 230 ^ 235 ^ 240 

Tyr Lys Tyr Ala Leu Asp Ala Leu Leu He Asn Glu Tyi Ser Cys Leu 
245 250 255 

35 His Asn Lys Cys Leu Val Trp Phe Glu Glu Ala Ser Val Asn Ser Cys 

260 265 270 

Leu Val Thr Gly Gly Asp Val Leu Asp Lys Asn Gly Leu His Glu Arg 

275 280 ' 285 

Gin Arg He Tyr Val Leu Gly Arg Pro Ser Leu Val Ser Gly Phe Lys 
4 0 290 ^ 295 300 

Leu Glu Lys Gin Gly He Arg Phe Leu Arg Ser Lys Lys Thr Leu Leu 
305 ~ 310 315 " ' 320 

Leu Pro Phe Ala Gin Ala Gin Cys Leu Phe Ala Ala Lys Ala Gin Pro 
325 330 335 

4 5 Thr Arg Arg Trp Asn Tyr Pro Ser He Asn Cys Asp Leu Val Arg Pro 

340 345 350 

Phe He Arg Leu Ser Gin Arg Thr He Ala Glu Arg Ser Ala Asn Thr 

355 360 365 

Phe Ser Lys Leu Val Glu Pro Glu Gly Val Cys Phe Ala Lys Lys Asp 
50 370 375 380 

Phe Asn Leu Pro Lys His Pro Leu He Asp Val Pro Asn Leu Gin Val 
385 390 395 400 

He Lys Leu Met Gin Ser Phe Lys Ser Lys Glu Tyr Val Arg Glu Thr 
405 410 415 

5 5 Phe Ala Trp Met His Tyr Tyr Trp Phe Leu Thr Asn Glu Gly He Glu 

420 425 430 

Phe Leu Arg Thr Tyr Leu Asn Leu Pro Ser Asp Val Val Pro Ala Thr 

435 440 445 

Leu Lys Lys Ser Ala Lys Pro Gly Gly Arg Pro Phe Gly Gly Pro Pro 
60 450 455 460 

Gly Asp Arg Gin Arg Gly Pro Pro Arg Ser Asp Gly Asp Arg Pro Arg 
465 470 475 480 
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Phe Gly Asp Arg Asp Gly Tyr Arg Gly Gly Pro Arg Gly Gly Asp Glu 

485 490 495 

Lys Gly Gly Ala Pro Ala Asp Phe Gin Pro Ser Phe Gin Gly Gly Gly 

500 505 510 

Gly Arg Pro Gly Phe Gly Arg Gly Ala Gly Gly Tyr Ser Ala Ala Ala 

515 520 525 

Pro Ser Gly Ser Gly Phe Pro 
530 535 
(2) INFORMATION FOR SEQ ID NO: 4 75: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: I.. 350 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050904 
;xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 5 : 
Met Ala Asn Ser Phe 7 a I Leu Phe Leu Ser Ser Leu Ala Pro Asn Tyr 
15 10 15 

lie Ala Gly Thr Ser Ser Val Thr He Leu Leu Ala Ala Phe Phe Leu 

20 25 30 

Phe Ser Gly Tyr Phe lie Ser Lys Glu Ser Leu Pro Lys Tyr Trp Leu 

35 4 0 4 5 

Phe Met Tyr Phe Phe Ser Met Tyr Lys Tyr Ala Leu Asp Ala Leu Leu 

50 " 55 " ~ 60 

He Asn Glu Tyr Ser Cys Leu His Asn Lys Cys Leu Val Trp Phe Glu 
65 70 75 80 

Glu Ala Ser Val Asn Ser Cvs Leu Val Thr Gly Gly Asp Val Leu Asp 

8 5 " 90 95 

Lys Asn Gly Leu His Glu Arg Gin Arg He Tyr V 1 Leu Gly Arg Pro 

100 105 110 

Ser Leu Val Ser Gly Phe Lys Leu Glu Lys Gin Gly He Arg Phe Leu 

115 120 " ' 125 

Arg Ser Lys Lys Thr Leu Leu Leu Pro Phe Ala Gin Ala Gin Cys Leu 

130 135 140 

Phe Ala Ala Lys Ala Gin Pro Thr Arg Arq Trp Asn Tyr Pro Ser He 
145 150 " 155 ' 160 

Asn Cys Asp Leu Val Arg Pro Phe He Arg Leu Ser Gin Arg Thr He 

165 170 175 

Ala Glu Arg Ser Ala Asn Thr Phe Ser Lys Leu Val Glu Pro Glu Gly 

180 185 190 

Val Cys Phe Ala Lys Lys Asp Phe Asn Leu Pro Lys His Pro Leu He 

195 200 205 

Asp Val Pro Asn Leu Gin Val He Lys Leu Met Gin Ser Phe Lvs Ser 

210 215 220 

Lys Glu Tyr Val Arg Glu Thr Phe Ala Trp Met His Tyr Tyr Trp Phe 
225 230 235 240 

Leu Thr Asn Glu Gly He Glu Phe Leu Arg Thr Tyr Leu Asn Leu Pro 

245 250 255 

Ser Asp Val Val Pro Ala Thr Leu Lys Lys Ser Ala Lys Pro Gly Gly 

260 265 270 

Arg Pro Phe Gly Gly Pro Pro Gly Asp Arg Gin Arg Gly Pro Pro Arg 

275 280 285 

Ser Asp Gly Asp Arg Pro Arg Phe Gly Asp Arg Asp Gly Tyr Arg Gly 

290 295 300 

Gly Pro Arg Glv Gly Asp Glu Lys Gly Gly Ala Pro Ala Asp Phe Gin 
305 310 315 320 

Pro Ser Phe Gin Gly Gly Gly Gly Arg Pro Gly Phe Gly Arg Gly Ala 
325 330 335 
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Gly Gly Tyr Ser Ala Ala Ala Pro Ser Gly Ser Gly Phe Pro 
340 345 350 

(2) INFORMATION FOR SEQ ID NO : 4 7 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..535 

V D) OTHER INFORMATION: / Ceres Seq. ID 2051325 
(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 476: 
gctacataac tcaaagtaac aaacatacac atataacata tagaaagatc gaaaacaaaa 
gatcgggatg agtatggctc caaagacctc aactacactt gctttattcc ttgtgaccaa 
tattctcttc ctcaacctca ttaccctgag ttgcgcagac aatacttgcc caagagacgt 
tctcaaactt tcgacatgct cgaatgttct caacctcatc aacttgaagc tcggggcacc 
agctatgagg ccttgttgct ctattctctt tggtctaatt gatctcgatg ttgctgtttg 
cctttgcacc gcgctcaagc tcagccttct tggcatcacc atcgacactc ctattcacct 
taacttggct cttaacgcct gtggaggtac ccttcctgat ggattccgtt gcccaacata 
gctacaatat attcatattt tctatctatg ttcttgaatt ggttttgtct aacatccaaa 
tttcaaacct aaattcatgt ataagcaaat aacaagtaaa attaagactt ttatg 
(2) INFORMATION FOR SEQ ID NO: 4 77: 
(i) SEQUENCE CHARACTERISTICS: 

i A } LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 117 

(D) OTHER INFORMATION: / Ceres Seq. ID 2051326 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 



Met 


Ser 


Met 


Ala 


Pro 


Lys 


Thr 


Ser 


Thr 


Thr 


Leu 


Ala 


Leu 


Phe 


Leu 


Val 


1 








5 










10 










15 




Thr 


Asn 


He 


Leu 


Phe 


Leu 


Asn 


Leu 


He 


Thr 


Leu 


Ser 


Cys 


Ala 


Asp 


Asn 








20 










25 










30 






Thr 


Cys 


Pro 


Arg 


Asp 


Val 


Leu 


Lys 


Leu 


Ser 


Thr 


Cys 


Ser 


Asn 


Val 


Leu 






35 










40 










45 








Asn 


Leu 


He 


Asn 


Leu 


Lys 


Leu 


Gly 


Ala 


Pro 


Ala 


Met 


Arg 


Pro 


Cys 


Cys 




50 










55 










60 










Ser 


He 


Leu 


Phe 


Gly 


Leu 


He 


Asp 


Leu 


Asp 


Val 


Ala 


Val 


Cys 


Leu 


Cys 


65 










70 










75 










80 


Thr 


Ala 


Leu 


Lys 


Leu 


Ser 


Leu 


Leu 


Gly 


He 


Thr 


He 


Asp 


Thr 


Pro 


He 










85 










90 










95 




His 


Leu 


Asn 


Leu 


Ala 


Leu 


Asn 


Ala 


Cys 


Gly 


Gly 


Thr 


Leu 


Pro 


Asp 


Gly 








100 










105 










110 






Phe 


Arg 


Cys 


Pro 


Thr 




























115 




























(2) 


INFORMATION 


FOR 


SEQ 


id : 


NO: 478 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: 1..115 

(D) OTHER INFORMATION: / Ceres Seq. ID 2051327 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 8 : 
Met Ala Pro Lys Thr Ser Thr Thr Leu Ala Leu Phe Leu Val Thr Asn 
1 5 10 15 

lie Leu Phe Leu Asn Leu lie Thr Leu Ser Cys Ala Asp Asn Thr Cys 
5 20 25 30 

Pro Arg Asp Val Leu Lys Leu Ser Thr Cys Ser Asn Val Leu Asn Leu 

3 5 4 0 4 5 

lie Asn Leu Lys Leu Gly Ala Pro Ala Met Arg Pro Cys Cys Ser lie 
50 55 60 

10 Leu Phe Gly Leu lie Asp Leu Asp Val Ala Val Cys Leu Cys Thr Ala 
65 70 -75 30 

Leu Lys Leu Ser Leu Leu Gly lie Thr lie Asp Thr Pro lie His Leu 

85 ' 90 95 

Asn Leu Ala Leu Asn Ala Cys Gly Gly Thr Leu Pro Asp Gly Phe Arg 
15 100 105 110 

Cys Pro Thr 
115 

(2) INFORMATION FOR SEQ ID NO : 4 7 9 : 
■i) SEQUENCE CHARACTERISTICS : 
2 0 fA) LENGTH: 74 amino acids 

(3) TYPE: amino acid 

(C) STRAMDEDNESS : 

(D) TOPOLOGY: linear 

. ii) MOLECULE TYPE: peptide 
2 5 ix) FEATURE: 

(A) NAME /KEY : peptide 
(3) LOCATION: 1 . . 74 

(D) OTHER INFORMATION: / Ceres Seq. ID 2051328 
'.xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 
30 Met Leu Leu Phe Ala Phe Ala Pro Arg Ser Ser Ser Ala Phe Leu Ala 
1 5 10 15 

Ser Pro Ser Thr Leu Leu Phe Thr Leu Thr Trp Leu Leu Thr Pro Val 

20 25 30 

Glu Val Pro Phe Leu Met Asp Ser Val Ala Gin His Ser Tyr Asn lie 
35 35 40 45 

Phe He Phe Ser He Tyr Val Leu Glu Leu Val Leu Ser Asn He Gin 

50 55 60 

He Ser Asn Leu Asn Ser Cys He Ser Lys 
65 70 
4 0 (2) INFORMATION FOR SEQ ID NO : 4 8 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
[ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .517 

50 (D) OTHER INFORMATION: / Ceres Seq. ID 2051633 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 
cataaccctc ttggaaagag tctcaacact tgcagagaaa aagaacaagg aagatcccgg 60 
aaaatggcaa cggcgattgt acgttcagct ctttcccgag cagtgactcg cgcagctccg 120 
aagacatccg tcgctcctaa gcgaaacttt tcctcttccg ccggccatga cgatgcttat 180 

55 gaagctgcga agtgggagaa gataacttat ctgggtattg ctagttgcac tgctctagct 240 
gtctatgttt tatccaaggg ccatcatcac ggcgaagacc ctcctgccta tccgcatatg 300 
cacatccgca acaaggagtt tccttggggt ccggatggtc tgtttgaggt gaagcacaac 360 
aaagagcact gagtcttgcg tggtcataat aacgtcttct tggtttattt gaaaggctaa 420 
aatgttttac cgtatttgtt ctcaccgttt gtcaacgatt tgcractcca atctcttttc 480 

60 rtttgttggg aaataaaagt taatactttg cttggtc 
(2) INFORMATION FOR SEQ ID NO : 4 8 1 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..123 

{ D) OTHER INFORMATION: / Ceres Seq. ID 2051634 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 1 : 



His 


Asn 


Pro 


Leu 


Gly 


Lys 


Ser 


Leu 


Asn 


Thr 


Cys 


A.rg 


Glu 


Lys 


Glu 


Gin 


1 








5 










10 










15 




Gly 


Arg 


Ser 


Arg 


Lys 


Met 


Ala 


Thr 


Ala 


He 


Val 


Arg 


Ser 


Ala 


Leu 


Ser 








20 










25 










30 






Arg 


Ala 


Val 


Thr 


Arg 


Ala 


Ala 


Pro 


Lys 


Thr 


Ser 


Val 


Ala 


Pro 


Lys 


Arg 






35 










40 










45 








Asn 


Phe 


Ser 


Ser 


Ser 


Ala 


Gly 


His 


Asp 


Asp 


Ala 


Tyr 


Glu 


Ala 


Ala 


Lys 




50 










5 5 










60 










Trp 


Glu 


Lys 


He 


Thr 


Tyr 


Leu 


Gly 


He 


Ala 


Ser 


Cys 


Thr 


Ala 


Leu 


Ala 


65 










70 










75 










80 


Val 




Val 


Leu 


Ser 


Lys 


Gly 


His 


His 


His 


Gly 


Glu 


Asp 


Pro 


Pro 


Ala 










8 5 










90 










95 




Tyr 


Pro 


His 


Met 


His 


He 


Arg 


Asn 


Lys 


Glu 


Phe 


Pro 


Trp 


Gly 


Pro 


Asp 








100 










105 










110 






Gly 


Leu 


Phe 


Glu 


Val 


Lys 


His 


Asn 


Lys 


Glu 


His 













115 120 



(2) INFORMATION FOR SEQ ID NO: 4 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: I . . 102 

(D) OTHER INFORMATION: / Ceres Seq. ID 2051635 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 



Met 


Ala 


Thr 


Ala 


lie 


Val 


Arg 


Ser 


Ala 


Leu 


Ser 


A.rg 


Ala 


Val 


Thr 


Arg 


1 








5 










10 










15 




Ala 


Ala 


Pro 


Lys 

20 


Thr 


Ser 


Val 


Ala 


Pro 
25 


Lys 


Arg 


Asn 


Phe 


Ser 
30 


Ser 


Ser 


Ala 


Gly 


His 

35 


Asp 


Asp 


Ala 


Tyr 


Glu 
40 


Ala 


Ala 


Lys 


Trp 


Glu 
45 


Lys 


He 


Thr 


Tyr 


Leu 
50 


Gly 


He 


Ala 


Ser 


Cys 

5 5 


Thr 


Ala 


Leu 


Ala 


Val 
60 


Tyr 


Val 


Leu 


Ser 


Lys 


Gly 


His 


His 


His 


Gly 


Glu 


Asp 


Pro 


Pro 


Ala 


Tyr 


Pro 


His 


Met 


His 


65 










70 










75 










80 


He 


Arg 


Asn 


Lys 


Glu 
85 


Phe 


Pro 


Trp 


Gly 


Pro 
90 


Asp 


Gly 


Leu 


Phe 


Glu 
95 


Val 


Lys 


His 


Asn 


Lys 
100 


Glu 


His 






















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 4* 


33: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. . 410 
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(D) OTHER INFORMATION: / Ceres Seq. ID 2051906 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 
ttccttcttt ttcacatcta cccaaaattc caaaacacca tcgatttttt gctctcttga 60 
aatgtgtctt tttatgtcca attcttctct tccaactaaa ccgaaccgga aaaccaggtt 120 
5 cggagatcgg tgtttactga tggccaaaca gcaacgaacc cgactttaca tacttcgacg 180 
atgtgtctcc atgctgcttt gctggcacga ccactctatt tctgattagt ttgattcatg 240 
aaccggacgg accaagaagt agaaatcaga agaatgtgat gtcatttaga gtttttcttg 300 
ttttctttta gaactcttca aggaatcttt tgtaattgaa gaagactcta aggattgggc 360 
cttttgggcc tttgtacata ttgtgtaaat aacatatgaa ctttttttac 
10 (2) INFORMATION FOR SEQ ID NO: 484: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



!ix) FEATURE: 

(A)- NAME /KEY: peptide 
•;E) LOCATION: 1 . . 75 







;D) OTHER 


INFORMATION: 


/ C 


eres Seq. I 


D 2051907 








;xi) 


SEQUENCE DESCRIPTION : S 


:EQ I 


D NO : 4 8 4 : 










Ser 


Phe 


Phe Phe 


Thr 


Ser 


Thr Gin 


Asn 


Ser Lys Thr 


Pro 


Ser 


He 


Phe 


1 






5 








10 






15 




Cys 


Ser 


Leu Glu 


Met 


Cys 


Leu Phe 


Met 


Ser Asn Ser 


Ser 


Leu 


Pro 


Thr 






20 








25 






30 






Lys 


Pro 


Asn Arg 


Lys 


Thr 


Arg Phe 


Gly 


Asp Arg Cys 


Leu 


Leu 


Met 


Ala 






35 






40 






45 








Lys 


Gin 


Gin Arg 


Thr 


Arg 


Leu Tyr 


He 


Leu Arg Arg 


Cys 


Val 


Ser 


Met 




50 








55 




60 










Leu 


Leu 


Cys Trp 


His 


Asp 


His Ser 


He 


Ser Asp 










65 








70 






75 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO : 4 8 5 : 













20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
i'ix) FEATURE: 

4 0 (A) NAME/KEY: peptide 

(B) LOCATION: 1. . 55- 

(D) OTHER INFORMATION: / Ceres Seq. ID 2051908 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:485: 
Met Cys Leu Phe Met Ser Asn Ser Ser Leu Pro Thr Lys Pro Asn Arg 
45 1 5 10 ^ 15 

Lys Thr Arg Phe Gly Asp Arg Cys Leu Leu Met Ala Lys Gin Gin Arg 

2 0 2 5 30 

Thr Arg Leu Tyr He Leu Arg Arg Cys Val Ser Met Leu Leu Cys Trp 
35 40 45 

50 His Asp His Ser He Ser Asp 
50 55 
(2) INFORMATION FOR SEQ ID NO : 4 8 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
55 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

60 (A) NAME /KEY : peptide 

(B) LOCATION: 1. .51 

(D) OTHER INFORMATION: / Ceres Seq. ID 2051909 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 6 : 
Met Ser Asn Ser Ser Leu Pro Thr Lys Pro Asn Arg Lys Thr Arg Phe 
15 10 15 

Gly Asp Arg Cys Leu Leu Met Ala Lys Gin Gin Arg Thr Arg Leu Tyr 
5 " 20 25 30 

lie Leu Arg Arg Cys Val Ser Met Leu Leu Cys Trp His Asp His Ser 

35 " 40 45 

lie Ser Asp 
5 0 

10 (2) INFORMATION FOR SEQ ID NO: 4 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 719 

20 ( D } OTHER INFORMATION: / Ceres Seq. ID 2052403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 87: 
acaaaicatt caaagacaca caaaataatt gagttttttt tttaattaga aacaaarggg 60 
gttgagtggt gttcttcatg tggaggttga ggitaagtct ccggctgaaa agttctgggt 120 
agcccttggc gacggcatca atctcttccc caaagctttc cctaacgact acaaaaccat 180 

25 ccaagttcta gccggcgatg gcaacgctcc tggctccatt cgcctcatta cttatggaga 240 
aggatctcca ctggtgaaga tatcggcgga gaggatcgaa gcagrggatt tggagaacaa 300 
aagcatgtcg tacagcatca ttggcggaga aatgttggag tactacaaaa ccttcaaagg 360 
aaccatcacc gttatcccta aggatggtgg tagccttctg aaatggtctg gtgagtttga 420 
qaaqaccgcc catgagatcq atgatccaca tgtcatcaag gactttgctg tcaagaactt 480 

30 caaagagata gatgagtatc ttcttaagca aactagtgcc taacactaga acctttaaat 540 
tatataagag ggttcgaccg tctctataag atttttctaa ttaagaagtt gaataaagtg 600 
gaaccccttt atgaatatcc acgttttatg atttcggagt ttatgcagcc tagtaggcca 660 
taagcttttt acaaaagcca atttagtcga acattttgaa aaaatcgaac cttgtggta 
(2) INFORMATION FOR SEQ ID NO : 4 8 8 : 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 155 

(D) OTHER INFORMATION: / Ceres Seq. ID 2052404 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 8 : 

Met Gly Leu Ser Gly Val Leu His Val Glu Val Glu Val Lys Ser Pro 
15 10 15 

Ala Glu Lys Phe Trp Val Ala Leu Gly Asp Gly lie Asn Leu Phe Pro 
20 25 30 

50 Lys Ala Phe Pro Asn Asp Tyr Lys Thr lie Gin Val Leu Ala Gly Asp 
35 40 45 

Gly Asn Ala Pro Gly Ser lie Arg Leu lie Thr Tyr Gly Glu Gly Ser 

50 55 60 

Pro Leu Val Lys lie Ser Ala Glu Arq lie Glu Ala Val Asp Leu Glu 
55 65 70 75 80 

Asn Lys Ser Met Ser Tyr Ser lie lie Gly Gly Glu Met Leu Glu Tyr 

85 90 95 

Tyr Lys Thr Phe Lys Gly Thr lie Thr Val lie Pro Lys Asp Gly Gly 
100 105 110 

60 Ser Leu Leu Lys Trp Ser Gly Glu Phe Glu Lys Thr Ala His Glu lie 
115 120 125 

Asp Asp Pro His Val lie Lys Asp Phe Ala Val Lys Asn Phe Lys Glu 



WO 00/40695 



PCT/US00/00466 



578 

130 135 140 

lie Asp Glu Tyr Leu Leu Lys Gin Thr Ser Ala 
145 150 155 

(2) INFORMATION FOR SEQ ID NO : 4 8 9 : 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 MOLECULE TYPE: DMA (genomic) 

'ix) FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION: 1 . .739 

( D } OTHER INFORMATION: / Ceres Seq. ID 205354 5 
15 -.xi; SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 9 : 

attttagtac attgttgacc atctttttcg 

ttaagtcagt aactaggaaa attcagaagc 

gattacagaa tttctaccaa aagagtacgg 

tttccccaac ctctggatgg gtgcrcaagt 
20 gtacccaact :^atatgcaa tagaatcaga 

icagagagga catcaaaact ctttagagat 

cggtgggatg aagcaccctt gtatcrgtac 

attcttccac tttaaaggtt acgctacrgg 

atacggtttc ttggggttgc taggt ctgat 
25 gatccttgct tgagctactc gtttctgggg 

tagaaccaat gcttgtaagc tgtccacaaa 

ttaaaagttt gtaataaatc atggcttcat 

ttaataaaga tttgaatta 

(2) INFORMATION FOR SEQ ID NO: 4 90: 
30 (L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D) TOPOLOGY: linear 
35 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . - 14 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053546 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 



45 



50 



55 



60 



t at agact ac 


tatctctgat 


etcttgegag 


60 


gctctcaat c 


t caaaaatat 


ccatggcggc 


120 


atatgtcgtt 


ct cgt cctcg 


tcttctactg 


180 


cggcagagct 


cgcaaaaggt 


acaacgtccc 


240 


aaacaaagat 


get aagctct 


tcaactgtgt 


300 


gatgccaatg 


tatttcatac 


t gatgatcct 


36C 


t ggcct t ggt 


tigcti:taca 


acgttagccg 


420 


agatcccatg 


aagcgt ctta 


cgat egggaa 


480 


gatatgtacc 


atctcgtttg 


gtgt cact ct 


540 


t taatgattc 


tctggtttgc 


tcgaagaata 


600 


acttgtgtaa 


tactttagag 


tttgtcactt 


660 


agaacagttg 


aaatttcaca 


teegtagacg 


720 



Met 


Ala 


Ala 


He 


Thr 


Glu 


Phe 


Leu 


Pro 


Lys 


Glu 


Tyr 


Gly 


Tyr 


Val 


Val 


1 








5 










10 










15 




Leu 


Val 


Leu 


Val 


Phe 


Tyr 


Cys 


Phe 


Leu 


Asn 


Leu 


Trp 


Met 


Gly 


Ala 


Gin 








20 










25 










30 






Val 


Gly 


Arg 


Al a 


Arg 


Lys 


Arg 


Tyr 


Asn 


Val 


Pro 


Tyr 


Pro 


Thr 


Leu 


Tyr 






35 










40 










45 








Ala 


He 


Glu 


Ser 


Glu 


Asn 


Lys 


Asp 


Ala 


Lys 


Leu 


Phe 


Asn 


Cys 


Val 


Gin 




50 










55 










60 










Arg 


Gly 


His 


Gin 


Asn 


Ser 


Leu 


Glu 


Met 


Met 


Pro 


Met 


Tyr 


Phe 


He 


Leu 


65" 










70 










75 










80 


Met 


He 


Leu 


Gly 


Gly 


Met 


Lys 


His 


Pro 


Cys 


He 


Cys 


Thr 


Gly 


Leu 


Gly 










85 










90 










95 




Leu 


Leu 


Tyr 


Asn 


Val 


Ser 


Arg 


Phe 


Phe 


Tyr 


Phe 


Lys 


Gly 


Tyr 


Ala 


Thr 








100 










105 










110 






Gly 


Asp 


Pro 


Met 


Lys 


Arg 


Leu 


Thr 


He 


Gly 


Lys 


Tyr 


Gly 


Phe 


Leu 


Gly 






115 










120 










125 








Leu 


Leu 


Gly 


Leu 


Met 


He 


Cys 


Thr 


He 


Ser 


Phe 


Gly 


Val 


Thr 


Leu 


He 




130 










135 










140 










Leu 


Ala 






























145 
































(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 491: 

















(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 118 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053547 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 



Met 


Gly 


Ala 


Gin 


Val 


Gly 


Arg 


Ala 


Arg 


Lys 


Arg 


Tyr 


Asn 


Val 


Pro 


Tyr 


1 








5 










10 










15 




Pro 


Thr 


Leu 


Tyr 


Ala 


He 


Glu 


Ser 


Glu 


Asn 


Lys 


Asp 


Ala 


Lys 


Leu 


Phe 








20 










25 










30 






Asn 


Cys 


Val 


Gin 


Arg 


Gly 


His 


Gin 


Asn 


Ser 


Leu 


Glu 


Met 


Met 


Pro 


Met 






35 










40 










4 5 








Tyr 


Phe 


He 


Leu 


Met 


He 


Leu 


Gly 


Gly 


Met 


Lys 


His 


Pro 


Cys 


He 


Cys 




50 










55 










60 










Thr 


Gly 


Leu 


Gly 


Leu 


Leu 


Tyr 


Asn 


Val 


Ser 


Arg 


Phe 


Phe 


Tyr 


Phe 


Lys 


65 










70 










-75 










80 


Gly 


Tyr 


Ala 


Thr 


Gly 


Asp 


Pro 


Met 


Lys 


Ara 


Leu 


Thr 


He 


Gly 


Lys 


Tyr 










85 










90 










95 




Gly 


Phe 


Leu 


Gly 


Leu 


Leu 


Gly 


Leu 


Met 


He 


Cys 


Thr 


He 


Ser 


Phe 


Gly 








100 










105 










110 






Val 


Thr 


Leu 


He 


Leu 


Ala 























115 

(2) INFORMATION FOR SEQ ID NO: 4 92: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 7 4 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053548 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 92: 
Met Met Pro Met Tyr Phe lie Leu Met He Leu Gly Gly Met Lys His 
1 5 10 15 

Pro Cys lie Cys Thr Gly Leu Gly Leu Leu Tyr Asn Val Ser Arg Phe 

20 25 " 30 

Phe Tyr Phe Lys Gly Tyr Ala Thr Gly Asp Pro Met Lys Arg Leu Thr 

35 4 0 4 5 

He Gly Lys Tyr Gly Phe Leu Gly Leu Leu Gly Leu Met lie Cys Thr 

50 55 60 

He Ser Phe Gly Val Thr Leu He Leu Ala 
65 70 
(2) INFORMATION FOR SEQ ID NO: 493: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..681 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053884 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 
ccatccaaaa aaaacaaaac aaaaaattat attcaagaga aaaaggaaaa aatgaatttc 
atctccgatc aggtaaagaa actctcaagc tcaacaccag aggagccaga ccacaacaag 1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



580 

ccagtcgaag gaaccgaaac agctacaaga ccagctacca acgccgagct catggcaagt 
gccaaggttg tagctgaagc tgctcaagcc gcagctcgta acgaatcaga caaactcgac 
aagggtaaag tcgccggagc ctctgctgat atcttagacg ctgccgagaa atacggtaag 
ttcgatgaaa agagtagcac tggtcagtac ctcgacaagg ctgagaagta tctcaacgac 
tacgagtcgt cacactccac cggtgctggt ggtcctcctc ctccgacgag tcaggctgag 
ccagcaagtc agcctgagcc ggcggctaag aaagacgatg aagagtctgg tggtgggctt 
ggaggttatg ccaagatggc tcaaggtttc ttgaagtgat ttgatcttta attgttgttc 
atcattttcg taataataaa ttaaataact agtatcgttt gtgactagtt tatgttgctt 
cgtttatgtt tatggggagt gacgagtgag tgtaataact tctggtgatc atgaatctaa 
tccatctttg ttgtgattat g 
(2) INFORMATION FOR SEQ ID NO: 494: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino. acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

■A) NAME /KEY : peptide 
;B) LOCATION: 1 . .172 
ID) OTHER INFORMATION: 



(ii) 
( ix) 



■; xi ; 
Pro Ser Lys 
1 



Ceres Seq. ID 2053885 
SEQUENCE DESCRIPTION: SEQ ID NO: 4 94: 

Lys Asn Lys Thr Lys Asn Tyr lie Gin Glu Lys Lys Giu 
5 10 15 

Lys Lys Leu Ser Ser Ser Thr 
20 25 " 30 

Pro Glu Glu Pro Asp His Asn Lys Pro Val Glu Gly Thr Glu Thr Ala 

35 40 45 

Thr Arg Pro Ala Thr Asn Ala Giu Leu Met Ala Ser Ala Lys Val Val 

50 55 60 

Ala Glu Ala Ala Gin Ala Ala Ala Arg Asn Glu Ser Asp Lys Leu Asp 
65 7 0 7 5 80 

Lys Gly Lys Val Ala Gly Ala Ser Ala Asp lie Leu Asp Ala Ala Glu 

85 90 95 

Lys Tyr Gly Lys Phe Asp Glu Lys Ser Ser Thr Gly Gin Tyr Leu Asp 

100 105 110 

Lys Ala Glu Lys Tyr Leu Asn Asd Tyr Glu Ser Ser His Ser Thr Gly 
115 120 125 

Gly Pro Pro Pro Pro Thr Ser Gin Ala Glu Pro Ala Ser Gin 
135 140 



Ala Gl\ 



130 

Pro Glu Pro Ala Ala Lys Lys Asp Asp Glu Glu Ser Gly Gly Gly Leu 



145 



160 



(2) 



150 155 
Gly Gly Tyr Ala Lys Met Ala Gin Gly Phe Leu Lys 
165 ' 170 

INFORMATION FOR SEQ ID NO: 4 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 155 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 
Met Asn Phe lie Ser Asp Gin Val Lys Lys Leu Ser Ser Ser Thr Pro 
15 10 15 

Glu Glu Pro Asp His Asn Lys Pro Val Glu Gly Thr Glu Thr Ala Thr 

20 25 30 

Arg Pro Ala Thr Asn Ala Glu Leu Met Ala Ser Ala Lys Val Val Ala 
35 40 45 



(ii) 
(ix) 



(xi) 



ID 2053886 



180 
240 
300 
360 
420 
480 
540 
600 
660 
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Glu Ala Ala Gin Ala Ala Ala Arg Asn Glu Ser Asp Lys Leu Asp Lys 

50 55 60 

Gly Lys Val Ala Gly Ala Ser Ala Asp lie Leu Asp Ala Ala Glu Lys 
65 70 75 80 

5 Tyr Gly Lys Phe Asp Glu Lys Ser Ser Thr Gly Gin Tyr Leu Asp Lys 

85 90 ~ 95 

Ala Glu Lys Tyr Leu Asn Asp Tyr Glu Ser Ser His Ser Thr Gly Ala 

100 105 110 

Gly Gly Pro Pro Pro Pro Thr Ser Gin Ala Glu Pro Ala Ser Gin Pro 
10 115 120 125 

Glu Pro Ala Ala Lys Lys Asp Asp Glu Glu Ser Gly Gly Gly Leu Gly 

130 135 140 

Gly Tyr Ala Lys Met Ala Gin Gly phe Leu Lys 
145 150 155 

15 {2} INFORMATION FOR SEQ ID NO: 4 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 
{ B ) TYPE: amino acid 
(C) STRANDEDNESS : 
20 (D) TOPOLOGY: linear 

(ii- MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: 1 . . 115 

25 (D) OTHER INFORMATION: / Ceres Seq. ID 2053887 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:496: 



30 



Met 


Ala 


Ser 


Ala 


Lys 


Val 


Val 


Ala 


Glu 


Ala 


Ala 


Gin 


Ala 


Ala 


Ala 


Arg 


1 








5 










10 










15 




Asn 


Glu 


Ser 


Asp 


Lys 


Leu 


Asp 


Lys 


Gly 


Lys 


Val 


Ala 


Gly 


Ala 


Ser 


Ala 








20 










25 










30 






Asp 


He 


Leu 


Asp 


Ala 


Ala 


Glu 


Lys 


Tyr 


Gly 


Lys 


Phe 


Asp 


Glu 


Lys 


Ser 






35 










40 










45 








Ser 


Thr 


Gly 


Gin 


Tyr 


Leu 


Asp 


Lys 


Ala 


Glu 


Lys 


Tyr 


Leu 


Asn 


Asp 


Tyr 




50 










55 










60 










Glu 


Ser 


Ser 


His 


Ser 


Thr 


Gly 


Ala 


Gly 


Gly 


Pro 


Pro 


Pro 


Pro 


Thr 


Ser 


65 










70 










75 










80 


Gin 


Ala 


Glu 


Pro 


Ala 


Ser 


Gin 


Pro 


Glu 


Pro 


Ala 


Ala 


Lys 


Lys 


Asp 


Asp 










85 










90 










95 




Glu 


Glu 


Ser 


Gly 


Gly 


Gly 


Leu 


Gly 


Gly 


Tyr 


Ala 


Lys 


Met 


Ala 


Gin 


Gly 








100 










105 










110 






Phe 


Leu 


Lys 





























35 



40 

Phe Leu 

115 

(2) INFORMATION FOR SEQ ID NO : 4 97 : 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 0 (ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 797 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053908 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 

55 ttctctcaaa atccataaaa agagagagag ataaataaag agaaaactaa agaagctaga 60 
agatggagaa aagtaatgat catgacaagg ctagccacgg cggctccggt ggtggtgcca 120 
cggagaagtg ggaggagact agcctcggaa tccqaactgc cqagacaatg ctccggttag 180 
ctccggtggg cctttgtgtt gcagcacttg ttgtcatgct taaggactct gagactaatg 240 
agttcggttc aatttcttac tccaatctca cagcctttag gtacttggtg cacgcaaatg 300 

60 gaatatgtgc aggctactct cttctatcag cagccattgc agcgatgcct cgttcttctt 360 
cgacaatgcc tcgtgtttgg accttctttt gtctcgacca gcttctgacc tacctggttc 420 
ttgctgctgg agctgtatca gctgaggttc tatacttggc ttacaatgga gactcagcca 480 
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ttacttggag cgatgcatgt agctcctatg gcggtttctg tcatagagcc actgcttctg 540 
ttataatcac attctttgtg gtttgtttct acatcgttct ctctctaatc tcctcttata 600 
agctctttac tcgctttgat cctccttcca ttgttgactc cgccaagaat ctcgaagtcg 660 
ctgtcttcgg aagttagatc ctccatttgc tcttcaagta catctagttt gtcatgtttc 720 
5 aaatgttgtg tgtttttaac tttggtcaag agaaagaatg cttatgtgtt tgctcttctt 780 
tcattgcttt tcttcct 

(2) INFORMATION FOR SEQ ID NO: 4 98: 
;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
iii) MOLECULE TYPE: peptide 
;ix) FEATURE: 

15 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 20 4 

{ D ) OTHER INFORMATION: / Ceres Seq. ID 2053909 
xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 98: 

Met Glu Lvs Ser Asn Asp His Asp Lvs Ala Ser His Glv Giy Ser Glv 
20 1 5 10 15 

GLy Giy Ala Thr Glu Lys Trp Gla Glu Thr Ser Leu Giy lie Arg Thr 
20 25 30 

Thr Met Leu Arg Leu Ala Pro Val Glv Leu Cvs Vai Ala Ala 



35 40 45 

2 5 Leu Val Val Met Leu Lys Asp Ser Glu Thr Asn Glu Phe Giy Ser lie 
50 55 60 

Ser Tyr Ser Asn Leu Thr Ala Phe Arg Tyr Leu Val His Ala Asn Giy 
65 7 0 75 8 0 

lie Cvs Ala Giy Tyr Ser Leu Leu Ser Ala Ala lie Ala Ala Met Pro 
30 85 90 95 

Arg Ser Ser Ser Thr Met Pro Arg Val Trp Thr Phe Phe Cys Leu Asp 

100 105 110 

Gin Leu Leu Thr Tyr Leu Val Leu Ala Ala Giy Ala Val Ser Ala Glu 
115 120 125 

35 Val Leu Tyr Leu Ala Tyr Asn Giy Asp Ser Ala lie Thr Trp Ser Asp 
130 135 140 

Ala Cvs Ser Ser Tyr Giy Giy Phe Cys His Arg Ala Thr Ala Ser Val 
145 150 155 160 

lie lie Thr Phe Phe Val Val Cys Phe Tyr lie Val Leu Ser Leu lie 
40 165 170 175 

Ser Ser Tyr Lys Leu Phe Thr Arg Phe Asp Pro Pro Ser He Val Asp 

180 " 185 190 

Ser Ala Lys Asn Leu Glu Val Ala Val Phe Giy Ser 
195 200 
4 5 (2; INFORMATION FOR SEQ ID NO: 4 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 169 

55 (D) OTHER INFORMATION: / Ceres Seq. ID 2053910 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 
Met Leu Arg Leu Ala Pro Val Giy Leu Cys Val Ala Ala Leu Val Val 
1 5 10 15 

Met Leu Lvs Asp Ser Glu Thr Asn Glu Phe Giy Ser lie Ser Tyr Ser 
60 20 25 30 

Asn Leu Thr Ala Phe Arg Tyr Leu Val His Ala Asn Giy lie Cys Ala 
3 5 4 0 4 5 
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Gly Tyr Ser Leu Leu Ser Ala Ala lie Ala Ala Met Pro Arg Ser Ser 

50 55 60 

Ser Thr Met Pro Arg Val Trp Thr Phe Phe Cys Leu Asp Gin Leu Leu 
65 70 75 80 

5 Thr Tyr Leu Val Leu Ala Ala Gly Ala Val Ser Ala Glu Val Leu Tyr 

85 90 95 

Leu Ala Tyr Asn Gly Asp Ser Ala lie Thr Trp Ser Asp Ala Cys Ser 

100 105 110 

Ser Tyr Gly Gly Phe Cys His Arg Ala Thr Ala Ser Val lie lie Thr 
10 115 120 125 

Phe Phe Val Val Cys Phe Tyr lie Val Leu Ser Leu lie Ser Ser Tyr 

130 135 140 

Lys Leu Phe Thr Arg Phe Asp Pro Pro Ser lie Val Asp Ser Ala Lys 
145 150 155 160 

15 Asn Leu Glu Val Ala Val Phe Gly Ser 

165 

(2) INFORMATION FOR SEQ ID NO: 500: 
(i) SEQUENCE CHARACTERISTICS: 

fA) LENGTH : 153 amino acids 
20 ( B ) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

25 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . .153 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053911 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:500: 
Met Leu Lys Asp Ser Glu Thr Asn Glu Phe Gly Ser lie Ser Tyr Ser 
30 1 5 10 15 

Asn Leu Thr Ala Phe Arg Tyr Leu Val His Ala Asn Gly lie Cys Ala 

20 25 30 

Gly Tyr Ser Leu Leu Ser Ala Ala lie Ala Ala Met Pro Arg Ser Ser 
35 4 0 4 5 

35 Ser Thr Met Pro Arg Val Trp Thr Phe Phe Cys Leu Asp Gin Leu Leu 
50 55 60 

Thr Tyr Leu Val Leu Ala Ala Gly Ala Val Ser Ala Glu Val Leu Tyr 
65 70 75 80 

Leu Ala Tyr Asn Gly Asp Ser Ala lie Thr Trp Ser Asp Ala Cys Ser 
40 85 90 95 

Ser Tyr Gly Gly Phe Cys His Arg Ala Thr Ala Ser Val lie lie Thr 

100 105 110 

Phe Phe Val Val Cys Phe Tyr lie Val Leu Ser Leu lie Ser Ser Tyr 
115 120 125 

4 5 Lys Leu Phe Thr Arg Phe Asp Pro Pro Ser lie Val Asp Ser Ala Lys 

130 ~ 135 140 

Asn Leu Glu Val Ala Val Phe Gly Ser 
145 150 
(2) INFORMATION FOR SEQ ID NO : 5 0 1 : 

5 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 5 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 652 

(D) OTHER INFORMATION: / Ceres Seq. ID 2056123 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:501: 

attttttctt tctctctctc ctccctaagc aaaactaaaa caagctatgg ctggtatgct 60 
tcccggagtt gagtgtgcaa ggaggcggcg cttccacggt ggtgctcctc cgattgaatc 120 
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20 25 30 

Gly His Val Trp Thr Arg Arg Pro Ser Phe Ser Leu Tyr Thr Thr Asn 

35 40 45 

His Glu Ser His Gin Ala His Val Ser Phe Ser Glu Arg Ser Val Arg 
5 50 55 60 

Asn Lys Ser Tyr Gly Glu Asp Asn Asp Glu Lys Leu Asp Gly Ala Ala 
65 70 75 80 

Lys Glu Ala Lys Gin Arg Leu Asn Lys Arg Leu Arg lie Pro Pro Arg 
85 90 95 

10 Thr Arg Gin Asn Gly Lys Asp Lys Gly Asn Lys Leu Glu Gin Gly Lys 

100 J 105 110 

Gly Lys Pro Leu Gly Asp Leu Pro Thr Glu Val Val Gly Leu Lys Lys 

115 120 125 

Ser Arg Gly Arg Leu Met Glu Trp Phe Lys Arg Arg Val Arg Glu Gin 
15 130 135 140 

Gin Asp Cys Ala lie Cys Leu Asp Arg Phe Lys Lys Gly Glu Thr Leu 
145 150 155 160 

Val His Leu Pro Cys Ala His Lys Phe His Ser lie Cys Leu Leu Pro 
165 170 175 

2 0 Trp Leu Asp Thr Asn Val Tyr Cys Pro Tyr Cys Arg Thr Asp lie Trp 

180 185 190 

Asn 

(2) INFORMATION FOR SEQ ID MO: 504: 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .2619 

(D) OTHER INFORMATION: / Ceres Seq. ID 2056245 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

atggagcaag agaaaagctt ggatccacaa ctatggcatg cttgtgcagg atcaatggtt 60 
caaatccctt cactgaattc aacggttttt tacttcgctc aaggccacac agagcacgct 120 
cacgcgcctc ctgattttca cgcgccgcgc gttccacctc ttatcctctg tcgtgtcgtc 180 
tccgtgaagt tcctcgccga cgctgaaaca gacgaagttt ttgctaaaat tacgcttttg 240 

40 ccacttccgg gaaacgactt ggatctagaa aacgacgccg ttttgggtct aactcctcct 300 
tcttctgacg gtaacggtaa cggtaaagag aaaccggcgt ctttcgctaa aacgttaacg 360 
cagtctgacg ctaataacgg cggtggtttc tccgttccac gttattgcgc cgagacgatt 420 
ttcccgcggc ttgattactc ggcggagcca ccggttcaaa ccgtgattgc taaagacatc 480 
cacggcgaga cttggaaatt ccggcatatt tacagaggaa cacctcgccg tcatctccta 540 

4 5 accaccggtt ggagcacttt cgttaaccag aagaaactaa tcgccggaga ctcaatcgtc 600 
ttcctccgtt ctgaatccgg tgacctctgc gtcggaatcc gccgcgctaa acgcggcggt 660 
ctcggatcta acgcaggatc cgacaatcct taccctggat tctccggttt cctccgtgac 720 
gacgagtcaa caacaacaac atcgaagcta atgatgatga aacgcaacgg aaacaacgac 780 
ggaaacgccg cggctacagg gagggttaga gtagaagcag tagcggaagc ggtggcgcgt 840 

50 gcagcgtgtg gacaagcgtt tgaggttgtt tattatccac gcgctagtac accggagttt 900 
tgcgtaaaag cagctgatgt tagatcagca atgaggataa gatggtgtag tggtatgcgt 960 
tttaaaatgg cgtttgaaac agaggattct tctagaatca gttggtttat gggtactgtc 1020 
tccgccgttc aagtcgctga tccaattcgt tggcctaatt caccatggcg tctccttcag 1080 
gtagcttggg acgaaccgga tttgttacaa aacgttaagc gggttagtcc gtggttagtc 1140 

55 gaattggtat cgaacatgcc tacaatacat ttatctccat tctctccgag gaagaagatt 1200 
aggattccgc agccatttga gtttccattc cacggtacta aattcccgat tttctccccg 1260 
ggattcgcca acaatggcgg tggcgagtcc atgtgttatc tgtcaaacga caacaataat 1320 
gctcctgcag gaatacaggg agccaggcaa gctcaacaac tcttcggatc accatctccg 1380 
tctttgttgt ctgatctcaa tcttagtagt tacaccggta acaacaagtt acattctccg 1440 

60 gcgatgtttc tatcgagttt caacccgagg catcatcatt atcaggctag ggatagtgag 1500 
aatagtaata acatttcgtg ttctttaact atggggaatc ctgctatggt tcaggataag 1560 
aagaagtctg ttggttcggt taagactcat cagttcgtgt tgttcggtca accgatttta 1620 



WO 00/40695 



PCT/US00/00466 



586 

accgaacagc aagttatgaa ccgaaaacgg tttttggaag aagaggcgga agcggaggag 
gagaaaggtt tagtggctcg tgggttaaca tggaattata gtttgcaagg acttgagacg 
ggtcattgta aagttttcat ggaatctgag gatgttggac gcacactcga tctctcggtt 
attggctcgt accaagaatt gtaccggaaa ttggctgaga tgtttcatat agaagagagg 
tcggatttgt tgactcatgt tgtgtaccgg gatgcaaatg gtgttatcaa acgtattgga 
gacgaacctt tcagtgattt catgaaagca actaaacggc taacaatcaa gatggatatt 
ggtggcgaca acgtgagaaa catttttagt ttccttcaaa taatattttt tttctttttt 
ttcaaaaaat taattttttt tttgttcttt tccgatggtg gatactatac tcatatacat 
atatattcat cagattatat atatatatat atatatatat atataagtaa tataaaaaaa 
aaacttattt tgttcattct tttaagttta aaaacagtca attcatttat atcatctatg 
atgaagaaac tcattcaact ctcgttcact gttatgatta tcttcaccat tcttgtgcta 
ggagtggtgg caaatgaggg gctaggaaaa ccaaaaaaac aatgtaatga gattctaaag 
caatccaact gtgttgctgc agagtgtgac tctatgtgtg tgaagaagag ggggaaagga 
gccggttact gctctccttc taaaaagtgt tactgctatt atcattgccc ttaacgattt 
tattatttac aataataaga gatttatgat aaagtagtaa aataatgaat cgggtgttta 
tagcctaacg attgatatag ttgtctctat caaataaact atatagacta aaagtaataa 
taataatgac aatcataact cagaagtttt tgtttagtc 
(2} INFORMATION FOR SEQ ID NO: 505: 
•L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 amino acids 

(E) TYPE: amino acid 

(C) S T RAN D E DN ESS : 

( D } TOPOLOGY: linear 
iii). MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 817 

(D) OTHER INFORMATION: / Ceres Seq. ID 2056246 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 

Met Glu Gin Glu Lys Ser Leu Asp Pro Gin Leu Trp His Ala Cys Ala 

1 5 10 15 

Gly Ser Met Val Gin lie Pro Ser Leu Asn Ser Thr Val Phe Tyr Phe 

20 25 30 

Ala Gin Gly His Thr Glu His Ala His Ala Pro Pro Asp Phe His Ala 

35 40 45 

Pro A.rg Val Pro Pro Leu lie Leu Cys Arg Val Val Ser Val Lys Phe 

5 0 5 5 60 

Leu Ala Asp Ala Glu Thr Asp Glu Val Phe Ala Lys lie Thr Leu Leu 
65 70 75 80 

Pro Leu Pro Gly Asn Asp Leu Asp Leu Glu Asn Asp Ala Val Leu Gly 

85 90 95 

Leu Thr Pro Pro Ser Ser Asp Gly Asn Gly Asn Gly Lys Glu Lys Pro 

100 " 105 110 

Ala Ser Phe Ala Lys Thr Leu Thr Gin Ser Asp Ala Asn Asn Gly Gly 

115 120 125 

Gly Phe Ser Val Pro Arg Tyr Cys Ala Glu Thr lie Phe Pro Arg Leu 

130 135 140 

Asp Tyr Ser Ala Glu Pro Pro Val Gin Thr Val lie Ala Lys Asp lie 
145 150 155 160 

His Gly Glu Thr Trp Lys Phe Arg His lie Tyr Arg Gly Thr Pro Arg 

165 170 175 

Arg His Leu Leu Thr Thr Gly Trp Ser Thr Phe Val Asn Gin Lys Lys 

180 ^ 185 190 

Leu lie Ala Gly Asp Ser lie Val Phe Leu Arg Ser Glu Ser Gly Asp 

195 200 205 

Leu Cys Val Gly lie Arg Arg Ala Lys Arg Gly Gly Leu Gly Ser Asn 

210 215 ~ 220 

Ala Gly Ser Asp Asn Pro Tyr Pro Gly Phe Ser Gly Phe Leu Arg Asp 
225 230 235 240 

Asp Glu Ser Thr Thr Thr Thr Ser Lys Leu Met Met Met Lys Arg Asn 

245 ^ 250 255 

Gly Asn Asn Asp Gly Asn Ala Ala Ala Thr Gly Arg Val Arg Val Glu 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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260 265 270 

Ala Val Ala Glu Ala Val Ala Arg Ala Ala Cys Gly Gin Ala Phe Glu 

275 280 285 

Val Val Tyr Tyr Pro Arg Ala Ser Thr Pro Glu Phe Cys Val Lys Ala 
5 290 295 300 

Ala Asp Val Arg Ser Ala Met Arg lie Arg Trp Cys Ser Gly Met Arg 
305 310 315 320 

Phe Lys Met Ala Phe Glu Thr Glu Asp Ser Ser Arg lie Ser Trp Phe 
325 330 335 

10 Met Gly Thr Val Ser Ala Val Gin Val Ala Asp Pro lie Arg Trp Pro 

340 345 350 

Asn Ser Pro Trp Arg Leu Leu Gin Val Ala Trp Asp Glu Pro Asp Leu 

355 360- ' 365 

Leu Gin Asn Val Lys Arg Val Ser Pro Trp Leu Val Glu Leu Val Ser 
15 370 375 380 

Asn Met Pro Thr lie His Leu Ser Pro Phe Ser Pro Arg Lys Lys lie 
385 390 395 400 

Arg lie Pro Gin Pro Phe Glu Phe Pro Phe His Gly Thr Lys Phe Pro 
405 410 415 

20 lie Phe Ser Pro Gly Phe Ala Asn Asn Gly Gly Gly Glu Ser Met Cys 

420 425 430 

Tyr Leu Ser Asn Asp Asn Asn Asn Ala Pro Ala Gly lie Gin Gly Ala 

435 440 445 

Arg Gin Ala Gin Gin Leu Phe Gly Ser Pro Ser Pro Ser Leu Leu Ser 
25 450 455 460 

Asp Leu Asn Leu Ser Ser Tyr Thr Gly Asn Asn Lys Leu His Ser Pro 
465 470 475 480 

Ala Met Phe Leu Ser Ser Phe Asn Pro Arg His His His Tyr Gin Ala 
485 490 495 

30 Arg Asp Ser Glu Asn Ser Asn Asn lie Ser Cys Ser Leu Thr Met Gly 

500 505 " 510 

Asn Pro Ala Met Val Gin Asp Lys Lys Lys Ser Val Gly Ser Val Lys 

515 " 520 525 

Thr His Gin Phe Val Leu Phe Gly Gin Pro lie Leu Thr Glu Gin Gin 
35 530 535 540 

Val Met Asn Arg Lys Arg Phe Leu Glu Glu Glu Ala Glu Ala Glu Glu 
545 550 555 560 

Glu Lys Gly Leu Val Ala Arg Gly Leu Thr Trp Asn Tyr Ser Leu Gin 
565 570 575 

40 Gly Leu Glu Thr Gly His Cys Lys Val Phe Met Glu Ser Glu Asp Val 

580 585 590 

Gly Arg Thr Leu Asp Leu Ser Val lie Gly Ser Tyr Gin Glu Leu Tyr 

595 600 605 

Arg Lys Leu Ala Glu Met Phe His lie Glu Glu Arg Ser Asp Leu Leu 
45 610 615 620 

Thr His Val Val Tyr Arg Asp Ala Asn Gly Val lie Lys Arg lie Gly 
625 630 635 640 

Asp Glu Pro Phe Ser Asp Phe Met Lys Ala Thr Lys Arg Leu Thr lie 
645 650 655 

50 Lys Met Asp lie Gly Gly Asp Asn Val Arg Asn lie Phe Ser Phe Leu 

660 665 670 

Gin lie lie Phe Phe Phe Phe Phe Phe Lys Lys Leu lie Phe Phe Leu 

675 680 685 

Phe Phe Ser Asp Gly Gly Tyr Tyr Thr His lie His lie Tyr Ser Ser 
55 690 695 700 

Asp Tyr lie Tyr lie Tyr lie Tyr lie Tyr lie Ser Asn lie Lys Lys 
705 710 715 720 

Lys Leu lie Leu Phe lie Leu Leu Ser Leu Lys Thr Val Asn Ser Phe 
725 730 735 

60 lie Ser Ser Met Met Lys Lys Leu lie Gin Leu Ser Phe Thr Val Met 

740 745 750 

lie lie Phe Thr lie Leu Val Leu Gly Val Val Ala Asn Glu Gly Leu 
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755 760 765 

Gly Lys Pro Lys Lys Gin Cys Asn Glu lie Leu Lys Gin Ser Asn Cys 

770 775 780 

Val Ala Ala Glu Cys Asp Ser Met Cys Val Lys Lys Arg Gly Lys Gly 
785 790 795 800 

Ala Gly Tyr Cys Ser Pro Ser Lys Lys Cys Tyr Cys Tyr Tyr His Cys 
805 810 815 

Pro 



{2} INFORMATION FOR SEQ ID NO: 50 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1..7 99 

I'D) OTHER INFORMATION: / Ceres Seq. ID 2056247 
ixi; SEQUENCE DESCRIPTION: SEQ ID NO: 50 6: 



Met 


val 


Gin 


lie 


Pro 


Ser 


Leu 


Asn 


Ser 


Thr 


Val 


Phe 


T vr 


Phe 


Ala 


Gin 


1 








5 










10 










15 




Gly 


His 


Thr 


Glu 


His 


Ala 


His 


Ala 


Pro 


Pro 


Asp 


Phe 


His 


Ala 


Pro 


Arg 








20 










25 










30 






Val 


Pro 


Pro 


Leu 


He 


Leu 


Cys 


Arg 


Val 


Val 


Ser 


Val 


Lys 


Phe 


Leu 


Ala 






35 










40 










45 








Asp 


Ala 


Glu 


Thr 


Asp 


Glu 


Val 


Phe 


Ala 


Lys 


He 


Thr 


Leu 


Leu 


Pro 


Leu 




50 










55 










60 










Pro 


Gly 


Asn 


Asp 


Leu 


Asp 


Leu 


Glu 


Asn 


Asp 


Ala 


Val 


Leu 


Gly 


Leu 


Thr 


65 










70 










75 










80 


Pro 


Pro 


Ser 


Ser 


Asp 


Gly 


Asn 


Gly 


Asn 


Gly 


Lys 


Glu 


Lys 


Pro 


Ala 


Ser 










85 










90 










95 




Phe 


Ala 


Lys 


Thr 


Leu 


Thr 


Gin 


Ser 


Asp 


Ala 


Asn 


Asn 


Gly 


Gly 


Gly 


Phe 








100 










105 










110 






Ser 


Val 


Pro 


Arg 


Tyr 


Cys 


Ala 


Glu 


Thr 


He 


Phe 


Pro 


Arg 


Leu 


Asp 


Tyr 






115 










120 










125 








Ser 


Ala 


Glu 


Pro 


Pro 


Val 


Gin 


Thr 


Val 


He 


Ala 


Lys 


Asp 


He 


His 


Gly 




130 










135 










140 










Glu 


Thr 


Trp 


Lys 


Phe 


Arg 


His 


He 


Tyr 


Arg 


Gly 


Thr 


Pro 


Arg 


Arg 


His 


145 










150 










155 










160 


Leu 


Leu 


Thr 


Thr 


Gly 


Trp 


Ser 


Thr 


Phe 


Val 


Asn 


Gin 


Lys 


Lys 


Leu 


He 










165 










170 










175 




Ala 


Gly 


Asp 


Ser 


He 


Val 


Phe 


Leu 


Arg 


Ser 


Glu 


Ser 


Gly 


Asp 


Leu 


Cys 








180 










185 










190 






Val 


Gly 


He 


Arg 


Arg 


Ala 


Lys 


Arg 


Gly 


Gly 


Leu 


Gly 


Ser 


Asn 


Ala 


Gly 






195 










200 










205 








Ser 


Asp 


Asn 


Pro 


Tyr 


Pro 


Gly 


Phe 


Ser 


Gly 


Phe 


Leu 


Arg 


Asp 


Asp 


Glu 




210 










215 










220 










Ser 


Thr 


Thr 


Thr 


Thr 


Ser 


Lys 


Leu 


Met 


Met 


Met 


Lys 


Arg 


Asn 


Gly 


Asn 


225 










230 










235 










240 


Asn 


Asp 


Gly 


Asn 


Ala 


Ala 


Ala 


Thr 


Gly 


Arg 


Val 


Arg 


Val 


Glu 


Ala 


Val 










245 










250 










255 




Ala 


Glu 


Ala 


Val 


Ala 


Arg 


Ala 


Ala 


Cys 


Gly 


Gin 


Ala 


Phe 


Glu 


Val 


Val 








260 










265 










270 






Tyr 


Tyr 


Pro 


Arg 


Ala 


Ser 


Thr 


Pro 


Glu 


Phe 


Cys 


Val 


Lys 


Ala 


Ala 


Asp 






275 










280 










285 








Val 


Arg 


Ser 


Ala 


Met 


Arg 


He 


Arg 


Trp 


Cys 


Ser 


Gly 


Met 


Arg 


Phe 


Lys 




290 










295 










300 










Met 


Ala 


Phe 


Glu 


Thr 


Glu 


Asp 


Ser 


Ser 


Arg 


He 


Ser 


Trp 


Phe 


Met 


Gly 


305 










310 










315 










320 


Thr 


Val 


Ser 


Ala 


Val 


Gin 


Val 


Ala 


Asp 


Pro 


He 


Arg 


Trp 


Pro 


Asn 


Ser 
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325 330 335 

Pro Trp Arg Leu Leu Gin Val Ala Trp Asp Glu Pro Asp Leu Leu Gin 

34 0 345 350 

Asn Val Lys Arg Val Ser Pro Trp Leu Val Glu Leu Val Ser Asn Met 
5 355 360 365 

Pro Thr lie His Leu Ser Pro Phe Ser Pro Arg Lys Lys lie Arg lie 

370 375 380 

Pro Gin Pro Phe Glu Phe Pro Phe His Gly Thr Lys Phe Pro lie Phe 
385 390 395 400 

10 Ser Pro Gly Phe Ala Asn Asn Gly Gly Gly Glu Ser Met Cys Tyr Leu 

405 410 415 

Ser Asn Asp Asn Asn Asn Ala Pro Ala Gly lie Gin Gly Ala Arg Gin 

420 425 ~ 430 

Ala Gin Gin Leu Phe Gly Ser Pro Ser Pro Ser Leu Leu Ser Asp Leu 
15 435 440 445 

Asn Leu Ser Ser Tyr Thr Gly Asn Asn Lys Leu His Ser Pro Ala Met 

450 455 460 

Phe Leu Ser Ser Phe Asn Pro Arg His His His Tyr Gin Ala Arg Asp 
465 470 475 ~ • 480 

20 Ser Glu Asn Ser Asn Asn lie Ser Cys Ser Leu Thr Met Gly Asn Pro 

485 490 495 

Ala Met Val Gin Asp Lys Lys Lys Ser Val Gly Ser Val Lys Thr His 

500 505 510 

Gin Phe Val Leu Phe Gly Gin Pro lie Leu Thr Glu Gin Gin Val Met 
25 515 520 525 

Asn Arg Lys Arg Phe Leu Glu Glu Glu Ala Glu Ala Glu Glu Glu Lys 

530 535 540 

Gly Leu Val Ala Arg Gly Leu Thr Trp Asn Tyr Ser Leu Gin Gly Leu 
545 550 555 560 

30 Glu Thr Gly His Cys Lys Val Phe Met Glu Ser Glu Asp Val Gly Arg 

565 570 ~ 575 

Thr Leu Asp Leu Ser Val lie Gly Ser Tyr Gin Glu Leu Tyr Arg Lys 

580 585 590 

Leu Ala Glu Met Phe His lie Glu Glu Arg Ser Asp Leu Leu Thr His 
35 595 600 1 605 

Val Val Tyr Arg Asp Ala Asn Gly Val lie Lys Arg lie Gly Asp Glu 

610 615 620 

Pro Phe Ser Asp Phe Met Lys Ala Thr Lys Arg Leu Thr lie Lys Met 
625 630 " " 635 640 

40 Asp lie Gly Gly Asp Asn Val Arg Asn lie Phe Ser Phe Leu Gin lie 

645 650 655 

lie Phe Phe Phe Phe Phe Phe Lys Lys Leu lie Phe Phe Leu Phe Phe 

660 665 670 

Ser Asp Gly Gly Tyr Tyr Thr His lie His lie Tyr Ser Ser Asp Tyr 
45 675 680 685 

lie Tyr lie Tyr lie Tyr lie Tyr lie Ser Asn lie Lys Lys Lys Leu 

690 695 700 

lie Leu Phe lie Leu Leu Ser Leu Lys Thr Val Asn Ser Phe lie Ser 
705 710 715 720 

50 Ser Met Met Lys Lys Leu lie Gin Leu Ser Phe Thr Val Met lie lie 

725 730 735 

Phe Thr lie Leu Val Leu Gly Val Val Ala Asn Glu Gly Leu Gly Lys 

740 745 750 

Pro Lys Lys Gin Cys Asn Glu lie Leu Lys Gin Ser Asn Cys Val Ala 
55 755 760 765 

Ala Glu Cys Asp Ser Met Cys Val Lys Lys Arg Gly Lys Gly Ala Gly 

770 775 780 

Tyr Cys Ser Pro Ser Lys Lys Cys Tyr Cys Tyr Tyr His Cys Pro 
785 790 795 

60 (2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 amino acids 
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(B) TYPE: amino acid 

(C) ST HANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 5 67 

(D) OTHER INFORMATION: / Ceres Seq. ID 2056248 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
10 Met Met Met Lys Arg Asn Gly Asn Asn Asp Gly Asn Ala Ala Ala Thr 
15 10 15 

Gly A.rg Val Arg Val Glu Ala Val Ala Glu Ala Val Ala Arg Ala Ala 

20 25 30 

Cys Gly Gin Ala Phe Glu Val Val Tyr Tyr Pro Arg Ala Ser Thr Pro 
15 35 40 45 

Glu Phe Cys Val Lys Ala Ala Asp Val Arg Ser Ala Met Arg lie Arg 

50 5 5 60 

Trp Cys Ser Gly Met Arg Phe Lys Met Ala Phe Glu Thr Glu Asp Ser 
6 5 7 0 7 5 .80 

2 0 Ser Arg lie Ser Trp Phe Met Gly Thr Val Ser Ala Val Gin Val Ala 

8 5 90 ?5 

Asp Pro lie Arg Trp Pro Asn Ser Pro Trp Arg Leu Leu Gin Val Ala 

100 105 ' " 110 

Tro Asp Glu Pro Asp Leu Leu Gin Asn Val Lys Arg Val Ser Pro Trp 
25 115 120 125 

Leu Val Glu Leu Val Ser Asn Met Pro Thr lie His Leu Ser Pro Phe 

130 135 140 

Ser Pro Arg Lys Lys lie A.rg lie Pro Gin Pro Phe Glu Phe Pro Phe 
145 150 155 160 

30 His Gly Thr Lys Phe Pro lie Phe Ser Pro Gly Phe Ala Asn Asn Gly 

165 170 175 

Gly Gly Glu Ser Met Cys Tyr Leu Ser Asn Asp Asn Asn Asn Ala Pro 

180 185 190 

Ala Gly lie Gin Gly Ala Arg Gin Ala Gin Gin Leu Phe Gly Ser Pro 
35 195 200 205 

Ser Pro Ser Leu Leu Ser Asp Leu Asn Leu Ser Ser Tyr Thr Gly Asn 

210 215 220 

Asn Lys Leu His Ser Pro Ala Met Phe Leu Ser Ser Phe Asn Pro Arg 
225 230 235 240 

40 His His His Tyr Gin Ala A.rg Asp Ser Glu Asn Ser Asn Asn lie Ser 

245 250 255 

Cys Ser Leu Thr Met Gly Asn Pro Ala Met Val Gin Asp Lys Lys Lys 

260 265 270 

Ser Val Gly Ser Val Lys Thr His Gin Phe Val Leu Phe Gly Gin Pro 
45 275 280 285 

lie Leu Thr Glu Gin Gin Val Met Asn Arg Lys A.rg Phe Leu Glu Glu 

290 295 ' 300 

Glu Ala Glu Ala Glu Glu Glu Lys Gly Leu Val Ala Arg Gly Leu Thr 
305 310 315 320 

50 Trp Asn Tyr Ser Leu Gin Gly Leu Glu Thr Gly His Cys Lys Val Phe 

325 " 330 ^ ^ 335 

Met Glu Ser Glu Asp Val Gly Arg Thr Leu Asp Leu Ser Val lie Gly 

340 345 350 

Ser Tyr Gin Glu Leu Tyr Arg Lys Leu Ala Glu Met Phe His lie Glu 
55 355 360 365 

Glu Arg Ser Asp Leu Leu Thr His Val Val Tyr Arg Asp Ala Asn Gly 

370 375 380 

Val lie Lys Arg lie Gly Asp Glu Pro Phe Ser Asp Phe Met Lys Ala 
385 390 395 400 

60 Thr Lys Arg Leu Thr lie Lys Met Asp lie Gly Gly Asp Asn Val Arg 

405 410 415 

Asn lie Phe Ser Phe Leu Gin lie lie Phe Phe Phe Phe Phe Phe Lys 
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420 425 430 

Lys Leu lie Phe Phe Leu Phe Phe Ser Asp Gly Gly Tyr Tyr Thr His 

4 35 4 40 4 45 

lie His lie Tyr Ser Ser Asp Tyr lie Tyr lie Tyr lie Tyr lie Tyr 
5 450 455 " 460 

lie Ser Asn lie Lys Lys Lys Leu lie Leu Phe lie Leu Leu Ser Leu 
465 470 475 480 

Lys Thr Val Asn Ser Phe lie Ser Ser Met Met Lys Lys Leu lie Gin 
485 490 495 

10 Leu Ser Phe Thr Val Met lie lie Phe Thr lie Leu Val Leu Gly Val 

500 505 510 

Val Ala Asn Glu Gly Leu Gly Lys Pro Lys Lys Gin Cys Asn Glu lie 

515 520 525 

Leu Lys Gin Ser Asn Cys Val Ala Ala Glu Cys Asp Ser Met Cys Val 
15 530 535 540 

Lys Lys Arg Gly Lys Gly Ala Gly Tyr Cys Ser Pro Ser Lys Lys Cys 
545 550 555 560 

Tyr Cys Tyr Tyr His Cys Pro 
565 

2 0 ;2) INFORMATION FOR SEQ ID NO: 508: 

!i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 776 

30 (D) OTHER INFORMATION: / Ceres Seq. ID 1941142 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 
agtcgcctcc ttcctgcgcc gccgccgccg ccgccgccgc cgctaccacc gcctcgcggc 60 
cgcctgcggc tccgcgtttc cgcacggaag ttggtgaagg tttcttgagg ctgagaacat 120 
ggtgctgaag acggaacttt gccgcttcag cggccagaag atttatcctg ggaaaggcat 180 

35 tagatttatc cgtgctgatt ctcaggtctt cctttttgcc aactcgaaat gcaagcgcta 240 
cttccacaac cgcctgaagc ctgcaaagct tacctggaca gcaatgtaca ggaagcagca 300 
caagaaggat atccatgctg aagcggtaaa gaagaggcgc cgcgccacca agaagccata 360 
ctccaggtca attgtgggtg cttccttgga agtaatccag aagaagagag ctgagaagcc 420 
agaggtccgc gatgctgcta gagaggctgc tcttcgtgag atcaaggagc gcatcaagaa 480 

4 0 gaccaaggat gagaagaaag cgaagaaggc ggaggtgagc aagtcccaga agacgcagac 540 
aaagggtgcg gtccagaagg gttccaaggg ccccaagttg ggcggcggtg gtgggaagcg 600 
ctgaaagaac ttagtgtcgt ttctcgacat tgcagtcgtt ccttagccaa agccactttc 660 
gtagaacctg tgttgaattt gcaagactta ttcaagcgtt gcttgtgcgt gctaaatacc 720 
atggcaagag aacggattta tatttatgcc tgaaaaaaat gaccgttcat attctt 

4 5 (2) INFORMATION FOR SEQ I D NO : 5 0 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..161 

55 (D) OTHER INFORMATION: / Ceres Seq. ID 1941143 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 
Met Val Leu Lys Thr Glu Leu Cys Arg Phe Ser Gly Gin Lys lie Tyr 
15 10 15 

Pro Gly Lys Gly lie Arg Phe lie Arg Ala Asp Ser Gin Val Phe Leu 
60 20 25 30 

Phe Ala Asn Ser Lys Cys Lys Arg Tyr Phe His Asn Arg Leu Lys Pro 
35 4 0 4 5 
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Ala Lys Leu Thr Trp Thr Ala Met Tyr Arg Lys Gin His Lys Lys Asp 

50 55 60 

lie His Ala Glu Ala Val Lys Lys Arg Arg Arg Ala Thr Lys Lys Pro 
65 70 75 80 

5 Tyr Ser Arg Ser lie Val Gly Ala Ser Leu Glu Val lie Gin Lys Lys 

85 90 95 

Arg Ala Glu Lys Pro Glu Val Arg Asp Ala Ala Arg Glu Ala Ala Leu 

100 105 110 

Arg Glu lie Lys Glu Arg lie Lvs Lys Thr Lys Asp Glu Lys Lys Ala 
10 115 120 125 

Lys Lys Ala Glu Val Ser Lys Ser Gin Lys Thr Gin Thr Lys Gly Ala 

130 135 140 

Val Gin Lys Gly Ser Lys Gly Pro Lys Leu Gly Gly Gly Gly Gly Lys 
145 150 155 160 

1 5 Arg 

(2) INFORMATION FOR SEQ ID NO: 510: 
(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 106 amino acids 
20 -B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
;. ix) FEATURE: 

25 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 106 

{ D) OTHER INFORMATION: / Ceres Seq. ID 1941144 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:510: 
Met Tyr Arg Lys Gin His Lys Lys Asp lie His Ala Glu Ala Val Lys 
30 1 5 10 15 

Lys Arg Arg Arg Ala Thr Lys Lys Pro Tyr Ser Arg Ser lie Val Gly 

20 25 30 

Ala Ser Leu Glu Val lie Gin Lys Lys Arg Ala Glu Lys Pro Glu Val 
35 40 45 

35 Arg Asp Ala Ala Arg Glu Ala Ala Leu Arg Glu lie Lys Glu Arg lie 
50 55 ~ 60 

Lys Lys Thr Lys Asp Glu Lys Lys Ala Lys Lys Ala Glu Val Ser Lys 
65 70 75 80 

Ser Gin Lys Thr Gin Thr Lys Gly Ala Val Gin Lys Gly Ser Lys Gly 

4 0 8 5 90 95 

Pro Lys Leu Gly Gly Gly Gly Gly Lys Arg 

100 105 
(2) INFORMATION FOR SEQ ID NO: 511: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1024 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 0 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..1024 

(D) OTHER INFORMATION: / Ceres Seq. ID 1942975 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 

55 ataaattcct ccgctcccct tcgccgcttc ctgtcgcgcc gccgctcccc tcccaaccca 60 
ctagctcaaa accctagagc tctcgcctgc tcgcctccgc cgccggagcg agctccccat 120 
cttcatccgt caaccatggc gacccagatc agcaagaaga agaagttcgt cagcgacggt 180 
gttttctacg ccgagctcaa tgagatgctg acgcgggagc tggcggagga cggctactct 240 
ggcgtggagg tgcgcgtcac gccgatgcgc acggagatca tcatccgcgc cacgcgcacg 300 

60 cagaacgtgc tcggcgagaa gggccgtagg atcagggagc tcacctccgt cgtccagaag 360 
aggttcaact tccctgagaa tggcgttgag ctttatgccg agaaggtcgt caaccgtggg 420 
ctctgtgcca tcgcgcaggc cgagtccctc cgttacaagc tcctcggtgg ccttgccgtc 480 
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cgcagggctt gctacggtgt tcttcgttat gttatggaga gtggtgccaa gggttgtgag 540 

gtgatcgtga gtggaaagct cagggcccaa agagccaagt ccatgaagtt caaggatggc 600 

tacatgatct catctggtca accagtgaac gagtacattg actcggctgt gagacacgtt 660 

cttctcagac agggtgttct tggcatcaag gtgaagatca tgcttgactg ggacccgaag 720 

ggcaaggttg gcccgatcac tcctcttccg gacctggtga ccatccacac cccgaaggac 780 

ggtcttggca cctcctgagg tgtaaggtcc caactgtgtc 840 

gggtcctaga taggtttcta tcttacgata aggacaatgc 900 

atctttatgg acagatgcta tgtgttgctt ctgtggcact 960 

cqqa.ga.caag atacttatat atgttcccga tgaaattatg 1020 



cgcgccctcc 
cagtttgatc 
ggttacctta 
ctaagttttg 



gaggacgagc 
cttgaaatgg 
taaggcagtt 
tttgtttaag 
gttc 

(2) INFORMATION FOR SEQ ID NO: 512: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION : I . .27 4 

{ D } OTHER INFORMATION: 



/ Ceres Sea. ID 1942976 



:xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 





He 


Asn 


Ser 


Ser 


Ala 


Pro 


Leu 


Arg 


Arg 


Phe 


Leu 


Ser 


arg 


Arg 


Arg 


Ser 




1 








5 










10 










15" 




25 


Pro 


Pro 


Asn 


Pro 


Leu 


Ala 


Gin 


Asn 


Pro 


Arg 


Ala 


Leu 


Ala 


Cys 


Ser 


Pro 










20 










25 










30 








Pro 


Pro 


Pro 


Glu 


Arg 


Ala 


Pro 


His 


Leu 


His 


Pro 


Ser 


Thr 


Met 


Ala 


Thr 








35 










40 










45 










Gin 


He 


Ser 


Lys 


Lys 


Lys 


Lys 


Phe 


Val 


Ser 


Asp 


Gly 


Val 


Phe 


Tyr 


Ala 


30 




50 










55 










60 












Glu 


Leu 


Asn 


Glu 


Met 


Leu 


Thr 


Arg 


Glu 


Leu 


Ala 


Glu 


Asp 


Gly 


Tyr 


Ser 




65 










70 










75 










80 




Gly 


Val 


Glu 


Val 


Arg 


Val 


Thr 


Pro 


Met 


Arg 


Thr 


Glu 


He 


He 


He 


Arg 












85 










90 










95 




35 


Ala 


Thr 


Arg 


Thr 


Gin 


Asn 


Val 


Leu 


Gly 


Glu 


Lys 


Gly 


Arg 


Arg 


He 


Arg 










100 










105 










110 








Glu 


Leu 


Thr 


Ser 


Val 


Val 


Gin 


Lys 


Arg 


Phe 


Asn 


Phe 


Pro 


Glu 


Asn 


Gly 








115 










120 










125 










Val 


Glu 


Leu 


Tyr 


Ala 


Glu 


Lys 


Val 


Val 


Asn 


Arg 


Gly 


Leu 


Cys 


Ala 


lie 


40 




130 










135 










140 












Ala 


Gin 


Ala 


Glu 


Ser 


Leu 


Arg 


Tyr 


Lys 


Leu 


Leu 


Gly 


Gly 


Leu 


Ala 


Val 




145 










150 










155 










160 




Arg 


Arg 


Ala 


Cys 


Tyr 


Gly 


Val 


Leu 


Arg 


Tyr 


Val 


Met 


Glu 


Ser 


Gly 


Ala 












165 










170 










175 




45 


Lys 


Gly 


Cys 


Glu 


Val 


He 


Val 


Ser 


Gly 


Lys 


Leu 


Arg 


Ala 


Gin 


Arg 


Ala 










180 










185 










190 








Lys 


Ser 


Met 


Lys 


Phe 


Lys 


Asp 


Gly 


Tyr 


Met 


He 


Ser 


Ser 


Gly 


Gin 


Pro 








195 










200 










205 










Val 


Asn 


Glu 


Tyr 


He 


Asp 


Ser 


Ala 


Val 


Arg 


His 


Val 


Leu 


Leu 


Arg 


Gin 


50 




210 










215 










220 












Gly 


Val 


Leu 


Gly 


He 


Lys 


Val 


Lys 


He 


Met 


Leu 


Asp 


Trp 


Asp 


Pro 


Lys 




225 










230 










235 










240 




Gly 


Lys 


Val 


Gly 


Pro 


He 


Thr 


Pro 


Leu 


Pro 


Asp 


Leu 


Val 


Thr 


He 


His 












245 










250 










255 




55 


Thr 


Pro 


Lys 


Asp 


Glu 


Asp 


Glu 


Pro 


Arg 


Pro 


Pro 


Val 


Leu 


Ala 


Pro 


Pro 










260 










265 










270 








Glu 


Val 
































(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO : 5 1 3 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .22 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1942977 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 
Met Ala Thr Gin lie Ser Lys Lys Lys Lys Phe Val Ser Asp Gly Val 
1 5 10 15 

Phe Tyr Ala Glu Leu Asn Glu Met Leu Thr Arg Glu Leu Ala Glu Asp 

20 25 30 

Gly Tyr Ser Gly Val Glu Val Arg Val Thr Pro Met Arg Thr Glu lie 

35 40 45 

lie lie Arg Ala Thr Arg Thr Gin Asn Val Leu Gly Glu Lys Gly Arg 

5 0 55 60 

Arg lie Arg Glu Leu Thr Ser Val Val Gin Lys Arg Phe Asn Phe Pro 
65 70 75 80 

Glu Asn Gly Val Glu Leu Tyr Ala Glu Lys Val Val Asn Arg Gly Leu 

85 90 95* 

Z y s Ala lie .Ala Gin Ala Glu Ser Leu Arg Tyr Lys Leu Leu Gly Gly 

100 105 110 

Leu Ala Val Arg Arg Ala Cys Tyr Gly Val Leu Arg Tyr Val Met Glu 

115 120 125 

Ser Gly Ala Lys Gly Cys Glu Val lie Val Ser Gly Lys Leu Arg Ala 

120 135 140 

Gin Arg Ala Lys Ser Met Lys Phe Lys Asp Gly Tyr Met lie Ser Ser 
145 150 155 160 

Gly Gin Pro Val Asn Glu Tyr lie Asp Ser Ala Val Arg Kis Val Leu 

165 170 175 

Leu Arg Gin Gly Val Leu Gly lie Lys Val Lys lie Met Leu Asp Trp 

180 185 190 

Asp Pro Lys Gly Lys Val Gly Pro lie Thr Pro Leu Pro Asp Leu Val 

195 ' 200 205 

Thr lie His Thr Pro Lys Asp Glu Asp Glu Pro Arg Pro Pre Val Leu 

210 215 220 

Ala Pro Pro Glu Val 
225 

i2) INFORMATION FOR SEQ ID NO: 514: 
■i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .206 

(D) OTHER INFORMATION: / Ceres Seq. ID 1942978 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
Met Leu Thr Arg Glu Leu Ala Glu Asp Gly Tyr Ser Gly Val Glu Val 
1 5 10 15 

Arg Val Thr Pro Met Arg Thr Glu lie lie lie A.rg Ala Thr Arg Thr 

20 25 30 

Gin Asn Val Leu Gly Glu Lys Gly Arg Arg lie Arg Glu Leu Thr Ser 

35 40 " 45 

Val Val Gin Lys Arg Phe Asn Phe Pro Glu Asn Gly Val Glu Leu Tyr 

50 55 60 

Ala Glu Lys Val Val Asn Aro Glv Leu Cvs Ala lie Ala Gin Ala Glu 
65 " 70 " 75 80 

Ser Leu Arg Tyr Lys Leu Leu Gly Gly Leu Ala Val Arg Arg Ala Cys 
8 5 90 9 5 
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Tyr Gly Val Leu Arg Tyr Val Met Glu Ser Gly Ala Lys Gly Cys Glu 

100 105 110 

Val lie Val Ser Gly Lys Leu Arg Ala Gin Arg Ala Lys Ser Met Lys 
115 120 125 

5 Phe Lys Asp Gly Tyr Met lie Ser Ser Gly Gin Pro Val Asn Glu Tyr 
130 135 140 

lie Asp Ser Ala Val Arg His Val Leu Leu Arg Gin Gly Val Leu Gly 
145 150 155 160 

lie Lys Val Lys lie Met Leu Asp Trp Asp Pro Lys Gly Lys Val Gly 
10 "* 165 170 175 

Pro lie Thr Pro Leu Pro Asp Leu Val Thr lie His Thr Pro Lys Asp 

180 185 190 

Glu Asp Glu Pro Arg Pro Pro Val Leu Ala Pro Pro Glu Val 
195 200 205 

15 (2) INFORMATION FOR SEQ ID NO: 515: 

Ji) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
2 0 (D) TOPOLOGY: linear 

vii- MOLECULE TYPE: DNA (genomic) 
lix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .10 69 

25 (D) OTHER INFORMATION: / Ceres Seq. ID 1944349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 
acttcgnttc tgcctttgac cagctcttgc gctcggctta tctcgtgcgg ttgttgctag 60 
aaccgtccct aattccattc ccgtgatctg gtttagtcat ttcatggttt tatttgttgc 120 
atctgtggtg aatctgtggt cacttagatc aacataggaa gatgtacgcc cagttgagtt 180 

30 ttgagtcggc tgttggtgtt tccctaacta gggactgagg gtctgaagct ctatacctgt 240 
tgcaatcgcg tagttgtgcg tgctagatgt atcagagaat ccctttgagt agaaagcaca 300 
cgcacacagc tgccacggta ttggcattcg taggtatctg tttcactgac gtgctacctg 360 
cacgaggata gcacctgata tttgtgcgct ttttttggtg aagatttttt gaagctgaga 420 
agatggtgct gaagacggaa ctctgccgct tcagtggcca gaagatctat cctgggaaag 480 

35 gcatccggtt tattcgtgct gattctcagg ttttcctttt tgccaactct aaatgcaagc 540 
gctacttcca caaccgcctg aagcctgcaa agcttacctg gacagcaatg tacaggaagc 600 
agcacaagaa ggacatccat gctgaagctg tcaagaagag gcgccgcgcc accaagaagc 660 
catacrcgag gtccattgtg ggtgcttcct tggaagtgat ccagaagaag agagctgaga 720 
agcctgaggt ccgtgatgct gctagagaag ctgctcttcg tgagatcaag gagcgcatca 780 

4 0 agaagaccaa ggacgagaag aaggcgaaga aggctgaggt gtccaagtcc cagaaggcgc 840 
agacgaaggg cgctgtccag aagggttcca agggccccaa gttgggcggc ggcggtggca 900 
aacgctgaaa aggaacgaag tgtcatttcg agacctcacc aaagtcacct ttgtagtacc 960 
tgggttaatt ttgcattttc gttcagagtt acttgtgact cttaactacc atgtcaagag 1020 
aatggattcg ttatttttat atattgggag aaatgatcgc cgttccttc 

4 5 (2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .161 

55 (D) OTHER INFORMATION: / Ceres Seq. ID 1944350 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:516: 
Met Val Leu Lys Thr Glu Leu Cys Arg Phe Ser Gly Gin Lys lie Tyr 
15 10 15 

Pro Glv Lys Gly lie Arg Phe lie Arg Ala Asp Ser Gin Val Phe Leu 
60 20 25 30 

Phe Ala Asn Ser Lys Cys Lys Arg Tyr Phe His Asn Arg Leu Lys Pro 
35 40 45 
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Ala 


Lys 


Leu 


Thr 


Trp 


Thr 


Ala 


Met 


Tyr 


Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 




3 U 










3 3 










D u 










lie 


his 


Ala 


Gl U 


Ala 


Val 


Lys 


Lys 


Arg 


Arg 


Arg 


Ala 


1 nr 


Lys 


Lys 


Pro 


D 3 










/ u 










/ 3 










Q Pi 

o u 


Tyr 


Ser 


Arg 


Ser 


lie 


Val 


ijly 


Ala 


Ser 


Leu 


Glu 


Val 


1 le 


Gin 


Lys 


Lys 










O 3 










Q Pi 










y 3 




Arg 


Ala 


Glu 


Lys 


Pro 


Glu 


Val 


Arg 


Asp 


Ala 


Ala 


Arg 


Glu 


Ala 


Ala 


Leu 








100 










105 










110 






Arg 


Glu 


He 


Lys 


Glu 


Arg 


He 


Lys 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 






115 










120 










125 








Lys 


Lys 


Ala 


Glu 


Val 


Ser 


Lys 


Ser 


Gin 


Lys 


Ala 


Gin 


Thr 


Lys 


Gly 


Ala 




130 










135 










140 










Val 


Gin 


Lys 


Gly 


Ser 


Lys 


Gly 


Pro 


Lys 


Leu 


Gly 


Gly 


Gly 


Gly 


Gly 


Lys 


145 










150 










155 










160 



1 5 Arg 



•2) INFORMATION FOR SEQ ID NO: 517: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 
2 0 i'B) TYPE: amine acid 

(C) STRAMDEDNESS : 

( D ) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
fix; FEATURE: 

2 5 (A) NAME /KEY: peptide 

(B) LOCATION: 1 . .10 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 194 4 351 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:517: 



Met 


Tyr 


Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 


He 


His 


Ala 


Glu 


Ala 


Val 


Lys 


1 








5 










10 










15 




Lys 


Arg 


Arg 


Arg 
20 


Ala 


Thr 


Lys 


Lys 


Pro 
25 


Tyr 


Ser 


Arg 


Ser 


He 
30 


Val 


Gly 


Ala 


Ser 


Leu 
35 


Glu 


Val 


He 


Gin 


Lys 
40 


L"s 


Arg 


Ala 


Glu 


Lys 
45 


Pro 


Glu 


Val 


Arg 


Asp 
50 


Ala 


Ala 


Arg 


Glu 


Ala 
55 


Ala 


Leu 


Arg 


Glu 


He 
60 


Lys 


Glu 


Arg 


He 


Lvs 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 


Lys 


Lys 


Ala 


Glu 


Val 


Ser 


Lvs 


65 










^0 










75 










80 


Ser 


Gin 


Lys 


Ala 


Gin 
85 


Thr 


Lys 


Gly 


Ala 


Val 

90 


Gin 


Lys 


Gly 


Ser 


Lys 
95 


Gly 


Pro 


Lys 


Leu 


Gly 


Gly 


Gly 


Gly 


Gly 


Lys 


Arg 















100 105 
{2} INFORMATION FOR SEQ ID NO: 518: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

50 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 89 

(D) OTHER INFORMATION: / Ceres Seq. ID 1944352 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:518: 
55 Met Leu Leu Glu Lys Leu Leu Phe Val Arg Ser Arg Ser Ala Ser Arg 
1 5 1(T 15 

A.rg Pro Arg Thr Arg Arg Arg Arg Arg Arg Leu Arg Cys Pro Ser Pro 

20 2 5 30 

Arg Arg Arg Arg Arg Arg Ala Leu Ser Arg Arg Val Pro Arg Ala Pro 
60 35 4 0 " 4 5 

Ser Trp Ala Ala Ala Val Ala Asn Ala Glu Lys Glu Arg Ser Val He 
50 55 60 
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Ser Arg Pro His Gin Ser His Leu Cys Ser Thr Trp Val Asn Phe Ala 
65 70 75 80 

Phe Ser Phe Arg Val Thr Cys Asp Ser 
85 

5 (2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: I . . 1035 

15 (D) OTHER INFORMATION: / Ceres Seq. ID 1964011 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 9 : 
aacatataaa ttacgtgctc cctcttcgcc gccgtctctc gcgccgccac 
acccactcgc ccaaaaccgt agcgccctcg cctgctcgcc gccgccgtcg 
cccatcttcc tccgttaacc atggcgaccc agatcagcaa gaagaagaag 

20 acggcgtgtt ctacgccgag ctcaatgaga tgctgacgcg ggagctcgcg 
actccggcgt ggaggtgcgt gucacaccga tgcgcacgga gatcarcatc 
GCacgcagaa cgtgctcggc gagaagggcc gcagGatcag ggagctcacg 
agaagaggtt caacttcccc gagaacggcg tcgagctcta cgccgagaag 
gcggcctctg cgccatcgcg caggctgagt ccctccgcta caagctcctc 

25 ccgtccggag ggcctgctat ggtgttcttc gctttgttat ggagagtggt 
gtgaggtcat tgtgagtgga aagctcaggg crcaaagagc caagtccatg 
atggctacat gatctcatct ggtcagccag tgaatgagta catcgactca 
acgtcctcct cagacagggt gttctgggta tcaaggtgaa gatcatgctt 
cgaagggcaa ggttggcccg atcacccctc ttccggacct ggtgaccatc 

30 aggaggagga cgagctgcgc cctcctgtct tggcagctga ggtctaaggt 
tcctcgaaat ggcagtttga tcgggtccta gctaggtttg atcttatgat 
ctaagacagt tcgttccgtg tcacttctgt ttaaaattaa gttttgccga 
atgtaccgga tgaaagtgtg gttcc^tatt gtatgctctg gaattttatt 
ttattttttt gccct 

35 (2) INFORMATION FOR SEQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS: 
4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .27 4 
45 (D) OTHER INFORMATION: / Ceres Seq. ID 1964012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 



His 


He 


Asn 


Tyr 


Val 


Leu 


Pro 


Leu 


Arg 


Arg 


Arg 


Leu 


Ser 


Arg 


Arg 


His 


1 








5 










10 










15 "* 




Phe 


Pro 


Pro 


Asp 


Pro 


Leu 


Ala 


Gin 


Asn 


Arg 


Ser 


Ala 


Leu 


Ala 


Cys 


Ser 








20 










25 










30 






Pro 


Pro 


Pro 


Ser 


Leu 


Arg 


Ala 


Pro 


His 


Leu 


Pro 


Pro 


Leu 


Thr 


Met 


Ala 






35 










40 










45 








Thr 


Gin 


He 


Ser 


Lys 


Lys 


Lys 


Lys 


Phe 


Val 


Ser 


Asp 


Gly 


Val 


Phe 


Tyr 




50 










55 










60 










Ala 


Glu 


Leu 


Asn 


Glu 


Met 


Leu 


Thr 


Arg 


Glu 


Leu 


Ala 


Glu 


Asp 


Gly 


Tyr 


65 










70 










75 










80 


Ser 


Gly 


Val 


Glu 


Val 


Arg 


Val 


Thr 


Pro 


Met 


Arg 


Thr 


Glu 


He 


He 


He 










85 










90 










95 




Arg 


Ala 


Thr 


Arg 


Thr 


Gin 


Asn 


Val 


Leu 


Gly 


Glu 


Lys 


Gly 


Arg 


Arg 


He 








100 










105 










110 






Arg 


Glu 


Leu 


Thr 


Ser 


Val 


Val 


Gin 


Lys 


Arg 


Phe 


Asn 


Phe 


Pro 


Glu 


Asn 






115 










120 










125 









tt ccctcccg 


60 


ttgcgagctc 


120 


t t cgt cagcg 


180 


gaggacggct 


240 


cgcgcaacgc 


300 


rctqtggtcc 


360 


gtcgtcaacc 


420 


ggaggccttg 


480 


gccgagggtt 


540 


aagttcaagg 


600 


gctgtgagac 


660 


gactgggacc 


720 


cataccccga 


780 


cccaacagcg 


840 


aagggcagtg 


900 


gacaagctat 


960 


tttggtatgt 


1020 
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Gly Val Glu Leu Tyr Ala Glu Lys Val Val Asn Arg Gly Leu Cys Ala 

130 135 140 

lie Ala Gin Ala Glu Ser Leu Arg Tyr Lys Leu Leu Gly Gly Leu Ala 
145 150 155 160 

5 Val Arg Arg Ala Cys Tyr Gly Val Leu Arg Phe Val Met Glu Ser Gly 

165 170 175 

Ala Glu Gly Cys Glu Val lie Val Ser Gly Lys Leu Arg Ala Gin Arg 

180 18-5 ~ 190 

Ala Lys Ser Met Lys Phe Lys Asp Gly Tyr Met lie Ser Ser Gly Gin 
10 195 200 205 

Pro Val Asn Glu Tyr lie Asp Ser Ala Val Arg His Val Leu Leu Arg 

210 215 22 0 

Gin Gly Val Leu Gly lie Lys Val Lys lie Met Leu Asp Trp Asp Pro 
225 230 235 240 

15 Lys Gly Lys Val Gly Pro lie Thr Pro Leu Pro Asp Leu Val Thr lie 

245 250 255 

His Thr Pro Lys Glu Glu Asp Glu Leu Arg Pro Pro Val Leu Ala Ala 
260 265 270 

Glu Val 

20 

:2) INFORMATION FOR SEQ ID NO: 5 21: 
•i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 
(3) TYPE: amino acid 
2 5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
;ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
30 (B) LOCATION: 1..228 

(D) OTHER INFORMATION: / Ceres Seq. ID 1964013 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:521: 
Met Ala Thr Gin lie Ser Lys Lys Lys Lys Phe Val Ser Asp Gly Val 
1 10 15 

35 Phe Tyr Ala Glu Leu Asn Glu Met Leu Thr Arg Glu Leu Ala Glu Asp 

?0 25 30 

Gly Tyr Ser Gly Val Glu Val Arg Val Thr Pro Met Arg Thr Glu lie 

35 40~ 45 

He lie Arg Ala Thr Arg Thr Gin Asn Val Leu Gly Glu Lys Gly Arg 
4 0 50 55 60~ 

Arg He Arg Glu Leu Thr Ser Val Val Gin Lys Arg Phe Asn Phe Pro 
65 70 75 80 

Glu Asn Gly Val Glu Leu Tyr Ala Glu Lys Val Val Asn Arg Gly Leu 
85 90 95 

4 5 Cys Ala He Ala Gin Ala Glu Ser Leu Arg Tyr Lys Leu Leu Gly Gly 

100 105 110 

Leu Ala Val Arg Arg Ala Cys Tyr Gly Val Leu Arg Phe Val Met Glu 

115 120 125 

Ser Gly Ala Glu Gly Cys Glu Val He Val Ser Gly Lys Leu Arg Ala 
50 130 135 140 

Gin Arg Ala Lys Ser Met Lys Phe Lys Asp Gly Tyr Met He Ser Ser 
145 150 155 " 160 

Gly Gin Pro Val Asn Glu Tyr He Asp Ser Ala Val Arg His Val Leu 
165 170 175 

55 Leu Arg Gin Gly Val Leu Gly He Lys Val Lys He Met Leu Asp Trp 

180 185 190 

Asp Pro Lys Gly Lys Val Gly Pro He Thr Pro Leu Pro Asp Leu Val 

195 200 205 

Thr He His Thr Pro Lys Glu Glu Asp Glu Leu Arg Pro Pro Val Leu 
60 210 215 220 

Ala Ala Glu Val 
225 
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(2) INFORMATION FOR SEQ ID NO: 522: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
10 (B) LOCATION: 1..205 

(D) OTHER INFORMATION: / Ceres Seq. ID 1964014 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 



15 



Met 


Leu 


i nr 


Arg 


'o _L U 


Leu 


Aj_ d 


r* 1 n 


Asp 


i y 


T yr 


Ser 


\jiy 


v ai 


C 1 1 T 

lol u 


v a i 


i 
i 


















i n 
r u 














Ar g 


v a i 


m u v~ 

i nr 


Pro 


Me t: 


Arg 


1 nr 


blu 


Ire 


lie 


lie 


Arg 


hi a 


i nr 


Arg 


rp l_ _ 

i nr 








z u 




















J5 U 






■^i±n 




vai 


Leu 


bi y 


Cj_lU 


T t r c 

JjVS 


Gly 


Arg 


Arg 


lie 


Arg 


LjIU 


Leu 


Tnr 


C Q T- 

ber 






J D 










4 U 


















v a l 


v a 1 


bin 


Lys 


Arg 


■ps i_ _ 

pne 


Asn 


Pne 


Pro 


Glu 


As n 


Gly 


val 


blU 


Leu 


Tyr 




5 0 










-J 0 










D U 










Ala 


Glu 


Lys 


Val 


Val 


Asn 


Arg 


Gly 


Leu 


Cys 


Ala 


He 


Ala 


Gin 


Ala 


Glu 


65 










7 0 










7 5 










80 


Ser 


Leu 


Arg 


Tyr 


Lys 


Leu 


Leu 


Gly 


Gly 


Leu 


Ala 


Val 


Arg 


Arg 


Ala 


Cys 










85 










90 










95 




Tyr 


Gly 


Val 


Leu 


Arg 


Phe 


Val 


Met 


Glu 


Ser 


Gly 


Ala 


Glu 


Gly 


Cys 


Glu 








100 










105 










110 






Val 


He 


Val 


Ser 


Gly 


Lys 


Leu 


Arg 


Ala 


Gin 


Arg 


Ala 


Lys 


Ser 


Met 


Lys 






115 










120 










125 








Phe 


Lys 


Asp 


Gly 


Tyr 


Met 


He 


Ser 


Ser 


Gly 


Gin 


Pro 


Val 


Asn 


Glu 


Tyr 




130 










135 










140 










He 


Asp 


Ser 


Ala 


Val 


Arg 


His 


Val 


Leu 


Leu 


Arg 


Gin 


Gly 


Val 


Leu 


Gly 


145 










150 










155 










160 


He 


Lys 


Val 


Lys 


He 


Met 


Leu 


Asp 


Trp 


Asp 


Pro 


Lys 


Gl_ 


Lys 


Val 


Gly 










165 










170 










175 




Pro 


He 


Thr 


Pro 


Leu 


Pro 


Asp 


Leu 


Val 


Thr 


He 


His 


Thr 


Pro 


Lys 


Glu 








180 










185 










190 






Glu 


Asp 


Glu 


Leu 


Arg 


Pro 


Pro 


Val 


Leu 


Ala 


Ala 


Glu 


Val 












195 










200 










205 








(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 5. 


23: 

















20 



25 



30 



35 



4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .818 

(D) OTHER INFORMATION: / Ceres Seq. ID 1983854 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:523: 

gatataagaa accctagcgg agccactaac acagtcgcct ccttcctgcr ccgccgccgc 60 
cgccgccgcc gccgctacca ccgcctcgcg gccgcctgcg gctccgcgtt tccgcacgga 120 
agttggtgaa ggtttcttga ggctgagaac atggtgctga agacggaact ttgccgcttc 180 
agcgscagaa gatttatcct gggaaaggca ttagatttat ccgtgctgat tctcaggtct 240 

55 tcctttttgc caactcgaaa tgcaagcgct acttccacaa ccgcctgaag cctgcaaagc 300 
ttacctggac agcaatgtac aggaagcagc acaagaagga tatccatgct gaagcggtaa 360 
agaagaggcg ccgcgccacc aagaagccat actccaggtc aattgtgggt gcttccttgg 420 
aagtaatcca gaagaagaga gctgagaagc cagaggtccg cgatgctgct agagaagctg 480 
ctcttcgtga gatcaaggag cgcatcaaga agaccaagga tgagaagaaa gcgaagaagg 540 

60 cggaggtgag caagtcccag aagacgcaga caaagggtgc ggtccagaag ggttccaagg 600 
gccccaagtt gggcggcggt ggtgggaagc gctgaaagaa cttagtgtcg tttctcgaca 660 
ttgcagtcgt tccttagcca aagccacttt cgtagaacct gtgttgaatt tgcaagactt 720 
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10 



20 



25 



30 



35 



40 



45 



50 



55 



60 



600 

attcaagcgt tgcttgtgcg tgctaaatac catggcaaga gaacggattt atatttatgc 
ctgaaaaaaa tgaccgttca tattcttaac tcatcttg 
(2) INFORMATION FOR SEQ ID NO: 524: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
peptide 



780 



(ii) 
(ix) 



(xi ; 



MOLECULE TYPE: 
FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 106 

(D) OTHER INFORMATION: / 
SEQUENCE DESCRIPTION: SEQ 



Ceres 
ID NO 



Seq. ID 
524 : 



1983855 



Met 


Tyr 


Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 


He 


His 


Ala 


Glu 


Ala 


Val 


Lys 


1 








5 










10 










15 




Lys 


Arg 


Arg 


Arg 


Ala 


Thr 


Lys 


Lys 


Pro 


Tyr 


Ser 


Arg 


Ser 


He 


Val 


Gly 








20~ 










25 










30 






Ala 


Ser 


Leu 


Glu 


7a 1 


lie 


Gin 


Lys 


Lys 


Arg 


Ala 


Glu 


Lys 


Pro 


Glu 


Val 
















40 










45 








Arg 


Asp 


~~ l a 


Ala 


Arg 


Glu 


Ala 


Ala 


Leu 


Ar a 


G 1 u 


He 


Lys 


Glu 


Arg 


lie 




5 0~ 










55 










60 










Lys 


Lys 


Thr 


Lys 


A.sp 


Glu 


Lys 


Lys 


Ala 


Lys 


Lys 


Ala 


Glu 


Val 


Ser 


Lys 


65 










7 0 










75 










80 


Ser 


Gin 


Lys 


Thr 


Gin 


Thr 


Lys 


Gly 


Ala 


Val 


Gin 


Lys 


Gly 


Ser 


Lvs 


Gly 










85 










90 










95 




Pro 


Lys 


Leu 


Gly 


Gly 


Gly 


Gly 


Gl y 


Lys 


Arg 















(2) 



100 

INFORMATION 

(i) 



FOR 



105 
525 : 



(ID 
(ix) 



SEQ ID NO: 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: 
FEATURE : 

(A) NAME/KEY: 

(B) LOCATION : 



DNA (genomic) 



:D 1990261 



1 . .802 

(D) OTHER INFORMATION: / Ceres Seq. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 
gatataagaa accctagcgg mscactaaya cagtcgcctc cttcctgcgc cgccgccgcc 
gccgccgcta ccaccgcctc gcggccgcct gcggctccgc gtttccgcac ggaagttggt 
gaaggtttct tgaggctgag aacatggtgc tgaagacgga actttgccgc ttcagcggcc 
agaagattta tcctgggaaa ggcattagat ttatccgtgc tgattctcag gtcttccttt 
ttgccaactc gaaatgcaag cgctacttcc acaaccgcct gaagcctgca aagcttacct 
ggacagcaat gtacaggaag cagcacaaga aggatatcca tgctgaagcg gtaaagaaga 
ggcgccgcgc caccaagaag ccatactcca ggtcaattgt gggtgcttcc ttggaagtaa 
tccagaagaa gagagctgag aagccagagg tccgcgatgc tgctagagaa gctgctcttc 
gtgagatcaa ggagcgcatc aagaagacca aggatgagaa gaaagcgaag aaggcggagg 
tgagcaagtc ccagaagacg cagacaaagg gtgcggtcca gaagggttcc aagggcccca 
agttgggcgg cggtggtggg aagcgctgaa agaacttagt gtcgtttctc gacattgcag 
tcgttcctta gccaaagcca ctttcgtaga acctgtgttg aatttgcaag acttattcaa 
gcgttgcttg tgcgtgctaa ataccatggc aagagaacgg atttatattt atgcctgaaa 
aaaaatgacc gttcatattc tt 
(2) INFORMATION FOR SEQ ID NO: 526: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 161 

(D) OTHER INFORMATION: / Ceres Seq. ID 2061972 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 



Met 


val 


Leu 


Lys 


inr 


bl U 


Leu 


Cys 


Arg 


rne 


ber 


Gly 


Gin 


Lys 


lie 


Tyr 


1 








O 










1 U 










1 0 




Pro 


(al y 


Lys 


gi y 


lie 


Arg 


rne 


Tl n 

lie 


Arg 


Ala 


Asp 


C -W- 

ber 


Liin 


val 


rne 


Leu 


















O C 
£ D 










J u 






Phe 


Ai a 


Asn 


Ser 


Lys 


Cys 


Lys 


Arg 


Tyr 


rne 


His 


Asn 


Arg 


Leu 


Lys 


Pro 






35 










40 










45 








Ala 


Lys 


Leu 


Thr 


Trp 


Thr 


Ala 


Met 


Tyr 


Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 




c; f j 










O D 










b U 










He 


His 


Ala 


Glu 


Ala 


Val 


Lys 


Lys 


A.rg 


Arg 


Arg 


Ala 


Thr 


Lys 


Lys 


Pro 


65 










70 










75 










80 


Tyr 


Ser 


Arg 


Ser 


He 


Val 


Gly 


Ala 


Ser 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Lys 










85 










90 










95 




Arg 


Ala 


Glu 


Lys 


Pro 


Glu 


Val 


Arg 


Asp 


Ala 


Ala 


Arg 


Glu 


Ala 


Ala 


Leu 








100 










105 










110 






Arg 


Giu 


He 


Lys 


Glu 


Arg 


He 


Lys 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 






115 










120 










125 








Lys 


Lys 

120 


A.I a 


Glu 


Val 


Ser 


Lys 
135 


Ser 


Gin 


Lys 


Thr 


Gin 
140 


Thr 


Lys 


Gly 


Ala 


Val 


Gin 


Lys 


Gly 


Ser 


Lys 


Gly 


Pre 


Lys 


Leu 


Gly 


Gly 


Gly 


Gly 


Gly 


Lys 


145 










150 










155 










160 



Arg 

(2) INFORMATION FOR SEQ ID NO: 527: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .10 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 2061973 





(xi J 


SEQUENCE DESCRIPTION: SEQ : 


ID NO:527 












Met 


Tyr 


Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 


He 


His 


Ala 


Glu 


Ala 


Val 


Lys 


-L 








5 










10 










15 




Lys 


A.rg 


Arg 


Arg 


Ala 


Thr 


Lys 


Lys 


Pro 


Tyr 


Ser 


Arg 


Ser 


He 


Val 


Gly 








20 










25 










30 






Ala 


Ser 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Lys 


Arg 


Ala 


Glu 


Lys 


Pro 


Glu 


Val 






35 










40 










45 








A.rg 


Asp 


Ala 


Ala 


Arg 


Glu 


Ala 


Ala 


Leu 


Arg 


Glu 


He 


Lys 


Glu 


Arg 


He 




50 










55 










60 










Lvs 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 


Lys 


Lys 


Ala 


Glu 


Val 


Ser 


Lys 


65 










70 










75 










80 


Ser 


Gin 


Lys 


Thr 


Gin 


Thr 


Lys 


Gly 


Ala 


Val 


Gin 


Lys 


Gly 


Ser 


Lys 


Gly 










85 










90 










95 




Pro 


Lys 


Leu 


Gly 


Gly 


Gly 


Gly 


Gly 


Lys 


Arg 















100 105 

SEQ TABLE 2 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
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(A) NAME /KEY : - 

(B) LOCATION: 1 . . 494 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007802 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
5 aacaatcgga agaaaggaga agacgaagat gtcgttggta tggttggaag cgatgttgcc 60 
tctcggaatc atcggtggga tgctctgtat catgggcaat tctcagtact acatccacaa 120 
agcttatcat ggccgtccta agcacatcgg ccacgatgaa tggga'tgttg ctatggaaag 180 
acgcgacaag aaagtcgtcg agaaagctgc agctccttcc tcatgatucg ctttatctct 240 
tttgtgttcc toaggggctt aaggtgacca cttgtggtga caaataaagt gcattccaga 300 
10 agaagaagaa gctgggggat ctagtacttt cattcccatt tgatrttcct tggacatatt 360 
aaagctttca gaaatcagac ctcaataaca tttggtttat caatatttct ctattcgtga 420 
tattttgtat gcttttaagt tgccitgatac actgaactac tctcttgctg tgtgagtgaa 480 
"aaatgaata tctc 

;2} INFORMATION FOR SEQ ID NO : 2 : 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 
;B) TYPE: amino acid 

(C) STRANDEDNESS : 

D) TOPOLOGY: linear 
2 0 ii) MOLECULE TYPE: peptide 

ix) FEATURE: 

■■' A ) NAME / KEY: pepci de 
:3) LOCATION: i . . 94 

:'D) OTHER INFORMATION: / Ceres Seq. ID 1007803 
2 5 ;xi; SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Gin Ser Glu Glu Arg Arg Arg Arg Arg Cys Arg Trp Tyr Gly Trp Lys 
I 5 10 15 

Arg Cys Cys Leu Ser Glu Ser Ser Val Gly Cys Ser Val Ser Trp Ala 
20 25 30 

30 He Leu Ser Thr Thr Ser Thr Lys Leu He Met Ala Val Leu Ser Thr 
3 5 4 0 4 5 

Ser Ala Thr Met Asn Gly Me 1 " Leu Leu Trp Lys Asp Ala Thr Arg Lys 

50 55 60 

Ser Ser Arg Lys Leu Gin Leu Leu Pro His Asp Ser Leu Tyr Leu Phe 
35 65 ' " 7 0 7 5 " 60 

Cys Val Pro Gin Gly Leu Lys Val Thr Thr Cys Gly Asp Lys 

85 90 
:2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
4 0 ;A) LENGTH: 74 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 5 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .74 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007804 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
50 Thr lie Gly Arg Lys Glu Lys Thr Lys Met Ser Leu val Trp Leu Glu 
15 10 15 

A.la Met Leu Pro Leu Gly lie lie Gly Gly Met Leu Cys He Met Gly 

20 25 30 

Asn Ser Gin Tyr Tyr He His Lys Ala Tyr His Gly Arg Pro Lys His 

5 5 35 4 0 45^ 

He Gly His Asp Glu Trp Asp Val Ala Met Glu Arg Arg Asp Lys Lys 

50 55 60 

Val Val Glu Lys Ala Ala Ala Pro Ser Ser 
65 70 
60 (2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 65 

(D) OTHER INFORMATION: / Ceres Seq. ID 1007805 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
10 Met Ser Leu Val Trp Leu Glu Ala Met Leu Pro Leu Gly lie lie Gly 
15 10 15 

Gly Met Leu Cys lie Met Gly Asn Ser Gin Tyr Tyr lie His Lys Ala 

20 " 25 ~ 30 

Tyr His Gly Arg Pro Lys His lie Gly His Asp Glu Trp Asp Val Ala 
15 35 40 45 

Met Glu Arg Arg Asp Lys Lys Val Val Glu Lys Ala Ala Ala Pro Ser 
50 55 60 

Ser 
65 

2 0 (.2) INFORMATION FOR SEQ ID NO : 5 : 

■ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .700 

30 (D) OTHER INFORMATION: / Ceres Seq. ID 1008556 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
aaaagaaaac tgaagaaccc caaaaatccc aagtgaaaga aaacgttatc cacttcgaga 60 
gagtgacacg tcacgaagca aatccaaatt caaaactttt ccataaaaat caaatcactt 120 
cttcttatta cttccccaaa tcgaaatcag tcacttatta aaaccctaaa tcatgacgat 180 

35 cgcaccggca ttgcagacga cgttcgtgtc atcaaccaac ttcctgaaac attcttcttc 240 
gtggggatca tcatcaccaa acaatgtgat tcttcccaaa aacaagagat cttctacctc 300 
cgtagttgtc gccgccgtcg gtgatgtctc ttctgacgga acaatctact taatcggcgg 360 
agccatcgcc gttgcactcg tcggaactgc atttccgatc ctcttcaaac gcaaagacac 420 
gtgtccggaa tgtgatggag caggatttgt gaggaaagga ggagtgactc tgagagccaa 480 

4 0 cgccgcacgg aaggatcttc ctcagatcgt ttgtgctaat tgcaatggac tcggaaagct 540 
taaccagatt gataaatcat aagtcttctt catatatatg taaaaagctc catttctcat 600 
tcctcttctt atctacaata tacctctgta agagtagata gcttcgaatg atcaatggtt 660 
gcttggagat ggatggatat atatatacaa ttgctttttt 
(2) INFORMATION FOR SEQ ID NO : 6 : 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
50 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 129 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008557 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Thr lie Ala Pro Ala Leu Gin Thr Thr Phe Val Ser Ser Thr Asn 
15 10 15 

Phe Leu Lys His Ser Ser Ser Trp Gly Ser Ser Ser Pro Asn Asn Val 
20 25 30 

60 lie Leu Pro Lys Asn Lys Arg Ser Ser Thr Ser Val Val Val Ala Ala 
35 40 45 

Val Gly Asp Val Ser Ser Asp Gly Thr lie Tyr Leu lie Gly Gly Ala 
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He Ala Val Ala 
65 

Lys Asp Thr Cys 

5 

Gly Val Thr Leu 
100 

Val Cys Ala Asn 
115 

10 Ser 



55 

Leu Val Gly Thr 
70 

Pro Glu Cys Asp 
85 

Arg Ala Asn Ala 

Cys Asn Gly Leu 
120 



604 

60 

Ala Phe Pro He 
75 

Gly Ala Gly Phe 
90 

Ala Arg Lys Asp 
105 

Gly Lys Leu Asn 



Leu Phe Lys Arg 
80 

Val Arg Lys Gly 
95 

Leu Pro Gin He 
110 

Gin He Asp Lys 
125 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
2 0 (A) NAME /KEY : - 

^3} LOCATION: I . . 665 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008628 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
atgttactct tctcagatct cttgttcgtt ttgatgtaca atggcgtcct tggtagcagc 60 

25 tcctatctct ttctcaggtg actctcatgt caaagcacac cgaaacttca atgcgattcg 120 
caaqagctct acattgactg t^caaacaaa atcaaaccgc agtcacaaac tctcggtttc 180 
rgcaggttac cgtgggggaa guaagggtgg tggaagtagt gat tttgtta ccggttttct 240 
tctaggaagt gctgtgttcg gaactctggc ttatatcttt gctccacaga tccgaagatc 300 
agtgctgagc gagaatgaat atggtttcaa gaaaccggag cagccgatgt actatgacga 360 

30 aggcctagag gagagaagag agatattgaa tgagaaaatc ggccaactca attccgccat 420 
tgacaaggtt tcgtcgcgtc tgaaaggagg tcggagcggt agcagcaaga acacttcttc 480 
gccgtctgtc ccagttgaaa ccgacgcaga agcagaagct actgcatgat tgaatgtaat 540 
cctctgctct attttaccaa ttcaaaactg ccttccattg gttctgtggt tttttttttt 600 
tttttgttgg aaccattagg ggcttttctg acttttagat attgaaagaa aaagacaatc 660 

35 gtcgc 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 
40 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
45 (B) LOCATION: 1 . . 162 

(D) OTHER INFORMATION: / Ceres Seq. ID 1008629 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
Met Ala Ser Leu Val Ala Ala Pro lie Ser Phe Ser Gly Asp Ser His 
15 10 15 

50 Val Lys Ala His Arg Asn Phe Asn Ala lie Arg Lys Ser Ser Thr Leu 

20 25 30 

Thr Val Gin Thr Lys Ser Asn Arg Ser His Lys Leu Ser Val Ser Ala 

35 40 * 45 

Gly Tyr Arg Gly Gly Ser Lys Gly Gly Gly Ser Ser Asp Phe Val Thr 
55 50 ~ 55 " 60 

Gly Phe Leu Leu Gly Ser Ala Val Phe Gly Thr Leu Ala Tyr lie Phe 
65 70 75 80 

Ala Pro Gin lie Arg Arg Ser Val Leu Ser Glu Asn Glu Tyr Gly Phe 
85 90 * 95 

60 Lys Lys Pro Glu Gin Pro Met Tyr Tyr Asp Glu Gly Leu Glu Glu Arg 

100 105 110 

Arg Glu lie Leu Asn Glu Lys lie Gly Gin Leu Asn Ser Ala lie Asp 
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115 120 125 

Lys Val Ser Ser Arg Leu Lys Gly Gly Arg Ser Gly Ser Ser Lys Asn 

130 135 140 

Thr Ser Ser Pro Ser Val Pro Val Glu Thr Asp Ala Glu Ala Glu Ala 
145 150 155 160 

Thr Ala 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 9: 

;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
linear 

DNA (genomic) 



(D) TOPOLOGY: 
ILL) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



1 . .1101 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009376 
: 'xij SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
cttctttcct ctttcttctt tctgttttct ggtcgctcgt tacccccaag acctcgtcga 60 
ggtt:::ttc tcrxtgaaaa agtgcttttg crgcgrttcc atgctctcgg aaggaacttg 120 
gctgaaicgt tggaggatta tcgatctggc gtcgttggtg caictctagc tgcgccaatt 180 
catcggatcc ggtt tgatt t ctccctttcg cgaatctagc cttccgtgat cttagccatg 240 
ttctaiggca cagctgtatg ggacccttgg cttatcgttg gccagattat ttgcctccaa 300 
tgctcmact atctcactct tggactcttc accatggtct ttcttggcct tcgtgttcct 360 
cgccrtagtc ttgtctactt cctcgattac gctactctca ctacttccac cttcaccggt 420 
tggtcrgtt a ttgcctcatt cctcttctct tcactcgctg gggctgtgta catgatattt 480 
ttggtggagc gagcacggaa atgcttagat ttctctgcaa ctctctacat catacatctc 540 
ttcttrtgca tcatgtatgg aggatggcct tcctctatgg cgtggtgggt tgttaatgga 600 
accggactcg ctgttatggc tttgctagct gag^tacttgt gcattaaacg cgaacagcga 660 
gagatcccta tggatcgctt ccactcaagg gtttgaagat tggtgaaaga acaagtgatg 720 
aagattcttg tgggttagca gcgcgactcg tagagcctag tagagcctag tactcttaac 780 
aattgtgatg tacagagatg aaatcagagg agtagaatgg gcgaaaggaa gttcagctag 840 
ccaagcaaca atgttggaca ctggtgcacc aactctgact gcaattttta gtgacttcaa 900 
atactaagac tttggaatac gtttcaattt gttcttttgt atgaaaaaaa cattagactc 960 
gaataggtta ttgatttcat gttgcttctt ttttttgttt atactggatt ttctcttcta 1020 
tcctcractt ggcgcagctt tgtgtgtttt aaaaccagtg tctcattcat catactcttt 1080 
taccaccaat tgaatccttc c 
(2) INFORMATION FOR SEQ ID NO : 1 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



peptide 
1 . . 152 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009377 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Met Phe Tyr Gly Thr Ala Val Trp Asp Pro Trp Leu lie Val Gly Gin 
15 10 15 

lie lie Cys Leu Gin Cys Ser Tyr Tyr Leu Thr Leu Gly Leu Phe Thr 

20 25 30 

Met Val Phe Leu Gly Leu Arg Val Pro Arg Leu Ser Leu Val Tyr Phe 

35 4 0 4 5 

Phe Asp Tyr Ala Thr Leu Thr Thr Ser Thr Phe Thr Gly Trp Ser Val 

50 55 60 

lie Ala Ser Phe Leu Phe Ser Ser Leu Ala Gly Ala Val Tyr Met lie 
65 70 75" 80 

Phe Leu Val Glu Arg Ala Arg Lys Cys Leu Asp Phe Ser Ala Thr Leu 
85 90 95 
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Tyr lie lie His Leu Phe Phe Cys lie Met Tyr Gly Gly Trp Pro Ser 

100 105 110 

Ser Met Ala Trp Trp Val Val Asn Gly Thr Gly Leu Ala Val Met Ala 

115 120 125 

Leu Leu Ala Glu Tyr Leu Cys lie Lys Arg Glu Gin Arg Glu lie Pro 

130 135 140 

Met Asp Arg Phe His Ser Arg Val 
145 150 
(2) INFORMATION FOR SEQ ID NO : 1 1 : 
fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 
( B } TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 

ax) FEATURE: 

(A) NAME /KEY : peptide 

( B ) LOCATION: 1 . .12 0 

( D) OTHER INFORMATION: /' Ceres Seq. ID 10 0 9 373 
xi) SEQUENCE DESCRIPTION: SEQ ID MO : 1 1 : 



Met 


Y a 1 


Phe 


Leu 


Gly 


Leu 


Arg 


V a I 


Pro 


Arg 


Leu 


Ser 


Leu 


Val 


Tyr 




















10 










15 


Phe 


Asp 


Tyr 


Ala 


Thr 


Leu 


Thr 


Thr 


Ser 


Thr 


Phe 


Thr 


Gly 


Trp 


Ser 








20 










25 










30 




He 


Ala 


Ser 


Phe 


Leu 


Phe 


Ser 


Ser 


Leu 


Ala 


Gly 


Ala 


Val 


Tyr 


Met 






35 










4 0 










4 5 






Phe 


Leu 


Val 


Glu 


Arg 


Ala 


Arg 


Ly s 


C y s 


Leu 


Asp 


Phe 


Ser 


Ala 


Thr 




50 










55" 










60 








Tyr 


lie 


He 


His 


Leu 


Phe 


Phe 


Cys 


He 


Met 


Tyr 


Gly 


Gly 


Trp 


Pro 


65 










70 










75 










Ser 


Met 


Ala 


Trp 


Trp 


Val 


Val 


Asn 


Gly 


Thr 


Gly 


Leu 


Ala 


Val 


Met 










85 










90 










95 


Leu 


Leu 


Ala 


Glu 


Tyr 


Leu 


Cys 


He 


Lys 


Arg 


Glu 


Gin 


Arg 


Glu 


He 








100 










105 










110 




Met 


Asp 


Arg 


Phe 


His 


Ser 


Arg 


Val 




















115 










120 
















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 1 


2 : 















80 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 76 

(D) OTHER INFORMATION: / Ceres Seq. ID 1009379 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Met Ala Gin Leu Tyr Gly Thr Leu Gly Leu Ser Leu Ala Arg Leu Phe 
15 10 15 

Ala Ser Asn Ala Leu Thr He Ser Leu Leu Asp Ser Ser Pro Trp Ser 

20 25 ~ 30 

Phe Leu Ala Phe Val Phe Leu Ala Leu Val Leu Ser Thr Ser Ser He 

35 40 45 

Thr Leu Leu Ser Leu Leu Pro Pro Ser Pro Val Gly Leu Leu Leu Pro 

50 55 60~ 

His Ser Ser Ser Leu His Ser Leu Gly Leu Cys Thr 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 base pairs 

(B) TYPE: nucleic acid 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..398 

(D) OTHER INFORMATION: / Ceres 
SEQUENCE DESCRIPTION: SEQ ID NO: 



(xi) 
aaat caatag 
get catactc 
gtcgt ctgtt 
tgaggatcat 
gcgcttccaa 
tattgattgt 
gtaaaaattg 



cttatctagt 
tcacat cct c 
aagtcgacac 
tccgcggttg 
gaagct ct t g 
ttttacgtt a 
actatatata 



t cttgactgt 
tagttageca 
gtgtctt egg 
ttctaggccg 
aacaaagct g 
aaa t cgett t 
at taaaggt a 



Seq. 
13: 

tcaacgat ca 
caaagctaag 
ttttctttgg 
gctctttggc 
tt ggt aattt 
t ataaattgg 
catt atgc 



ID 1011128 



atggcaccat 
aatcaatctc 
ccttggaagg 
gatceggcta 
gtacaatgtt 
aaattgaagt 



ccgctgcaat 
tgt catcgcc 
cat tagacaa 
ctatcgagaa 
tggttgggtt 
actgtaaaat 



60 
120 
180 
240 
300 
360 



(2) INFORMATION FOR SEQ ID NO: 14 



Met 
1 

His 
Thr 

Asp 

lie 
65 

(2) 



(ix) 



(xi) 
Ala 

Lys 

Arg 

His 

50 

GIu 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME / KEY : peptide 

(B) LOCATION: 1 . . 7 8 

(D) OTHER INFORMATION: / 
SEQUENCE DESCRIPTION: SEQ 



Pro Ser 



Ala 

Val 

35 

Ser 



Lys 

20 

Phe 



Ala 
5 

Asn 



Ala Met Leu lie 



Gin Ser Leu 



Gly Phe Leu 



Ser 
25 
Pro 



Ceres 
ID NO 
Leu 
10 
Ser 



ID 1011129 



Seq. 
: 14 : 

Ser His Pro Leu 



Ser 



Pro Ser Ser 



Ala Val Val 



Lys Arg Phe 



INFORMATION 

(i) 



FOR 



Gin 

70 

SEQ 



Leu 

55 

Glu 



Trp 
40 

Gly Arg 



Ala Leu Glu 



Trp Lys Ala Leu 
4 5 

Leu Phe Gly Asp 
60 

Gin Ser Cys 
75 



Val 

30 

Asp 



Val 
15 

Lys Ser 
Asn Glu 



Pro Ala Thr 



Trp 



ID NO: 15 : 



Met 
1 

Ser 



(ii) 
(ix) 



(xi) 
Leu 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1. .72 

(D) OTHER INFORMATION: / 
SEQUENCE DESCRIPTION: SEQ 
lie Leu Ser His Pro Leu Val 



Leu Ser 



Ser 

20 



5 

Pro 



Ser Ser Val 



Lys 
25 



Ceres Seq 
ID NO: 15 : 
Ser His Lys 
10 
Ser 



ID 1011130 



Ala Lys 



Thr Arg Val 



Phe 

30 



Asn Gin 
15 

Gly Phe 



Leu 


Trp Pro Trp 


Lys 


Ala 


Leu Asp Asn 


Glu 


Asp 


His 


Ser 


Ala 


Val 


Val 




35 






40 








45 








Leu 


Gly Arg Leu 


Phe 


Gly 


Asp Pro Ala 


Thr 


He 


Glu 


Lys 


Arg 


Phe 


Gin 




50 






55 






60 










Glu 


Ala Leu Glu 


Gin 


Ser 


Cys Trp 
















65 






70 


















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 16: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 54 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011718 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

10 agggatgtag taggccaata ggcaaatcag agaatcacaa atggtatctg gtcaagaaga 60 
ttcctgtttg tctgcagatg catgtgtgta atctagggta tatgtttttg tccatttggt 120 
ttcataaggc aataaag~tc cagctattta ctacttgtga agaaaaaagt gagggaaaac 180 
agagtcaagt ccgattactt ttctgggaaa gctgaggatt ttgtgttctt agagagcaga 240 
tgaagaggtg actgttgttt ctggtgagat agaagttggc atctaaattt gctttctctg 300 

15 gaccagtgaa atcaattgca gtctcctata tattgtagta ggcgatgtat cagtggtgaa 360 
gcgaggatga catctgtggg tacttgttct taatccttca ctctgatatc tatgctttag 420 
aagcgtttca agttcatgaa gctgatttga tgttgagttt ttaacaacaa gaatcaattc 480 
actccaaaaa tatgtaaact ctgccatgcc ttgagctgct ttttggatta tccactgttt 540 
tgttttgtg 

2 0 ■ ?.) INFORMATION FOR SEQ ID NO : 1 7 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: "0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

2 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 70 

30 (D) OTHER INFORMATION: / Ceres Seq. ID 1011719 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
Gly Cys Ser Arg Pro lie Gly Lys Se r Glu Asn His Lys Trp Tyr Leu 
1 ~ 5 10 " 15 

Val Lys Lys lie Pro Val Cys Leu Gin Met His Val Cys Asn Leu Gly 
35 20 25 30 

Tyr Met Phe Leu Ser lie Trp Phe His Lys Ala lie Lys lie Gin Leu 

35 40 45 

Phe Thr Thr Cys Glu Glu Lys Ser Glu Gly Lys Gin Ser Gin Val Arg 
50 55 60 

40 Leu Leu Phe Trp Glu Ser 
65 70 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

50 (A) NAME /KEY : peptide 

(B) LOCATION : 1 . . 45 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011720 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
Met His Val Cys Asn Leu Gly Tyr Met Phe Leu Ser lie Trp Phe His 
55 1 5 10 15 

Lys Ala lie Lys lie Gin Leu Phe Thr Thr Cys Glu Glu Lys Ser Glu 

20 25 30 

Gly Lys Gin Ser Gin Val Arg Leu Leu Phe Trp Glu Ser 
35 40 45 

60 (2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 37 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011721 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 : 
10 Met Phe Leu Ser lie Trp Phe His Lys Ala lie Lys lie Gin Leu Phe 
i 5 10 15 

Thr Thr Cys Glu Glu Lys Ser Glu Gly Lys Gin Ser Gin Val Arg Leu 

20 ~ 25 J 30 

Leu Phe Trp Glu Ser 
15 35 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

2 0 (C; STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
2 5 (B) LOCATION: 1 . . 4 17 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011735 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 
aattccttct ttttcacatc tacccaaaat tccaaaacac catcgatttt ttgctctctt 60 
gaaatgctct ttttatgtcc aattcttctc ttccaactaa accgaaccgg aaaaccaggt 120 
30 tcggagatcg gtgtttactg atggccaaac agcaacgaac ccgactttac atactccgaa 180 
gatgtgtctc catgctgctt tgctggcacg accactctat ttctgattag tttgcagtca 240 
tgaaccggac tgaccaagaa gtagcaatca gaagaatgtg atgtcattta gagttttttc 300 
ttgttttctt ttagaactct cttcagggaa tcttttgtaa ttgaagaaga ctctaaggat 360 
tgggcctttt gggcctttgt acatattgtg taaataacat atgaactttt tttaccc 
35 (2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION : 1 . . 51 

45 (D) OTHER INFORMATION: / Ceres Seq. ID 1011736 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
Met Ser Asn Ser Ser Leu Pro Thr Lys Pro Asn Arg Lys Thr Arg Phe 
1 5 10 ^ 15 

Gly Asp Arg Cys Leu Leu Met Ala Lys Gin Gin Arg Thr Arg Leu Tyr 
50 20 25 30 

lie Leu Arg Arg Cys Val Ser Met Leu Leu Cys Trp His Asp His Ser 

35" 40 45 

lie Ser Asp 
50 

55 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(xi) 



(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 46 

(D) OTHER INFORMATION: / 
SEQUENCE DESCRIPTION: SEQ 



610 



Ceres Seq. 
ID NO:22: 



ID 1011737 



Asn 
1 

Phe 



Ser Phe Phe 



He 



Phe Thr Ser Thr Gin Asn Ser Lys Thr Pre Ser 
5 10 15 

Cys Ser Leu Glu Met Leu Phe Leu Cys Pro He Leu Leu Phe Gin 
20 25 " 30 

Leu Asn Arg Thr Gly Lys Pro Gly Ser Glu He Gly Val Tyr 
35 40 45 

INFORMATION FOR SEQ ID NO: 23: 
H) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY: linear 
peptide 



:ype- 



MOLECULE 
FEATURE: 

(A) NAME/KEY 
'." B '■■ LOCATION 



SEC 
Ser 

Leu 



peptide 
1 . .32 

OTHER INFORMATION: / Ceres Seq. 
EMCE DESCRIPTION : SEQ ID NO: 23: 



3 he Ser His Leu Pro Lys i: 



Pro 



rs His His 



Leu 
20 

INFORMATION 

i i } 



Lvs 



vs 



Ber Phe 



(ii) 
(ix) 



/r \ 
25 

FOR SEQ ID NO: 2 4 : 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



il Gin Phe Phe 



Ser 
30 



Arg 

15 

Ser 



Phe 



Asn 



DNA (genomic) 



1 . . 712 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011755 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
actcaaatag atgaagaaat ggegagctta agcacgagct tgcrcggaat aagacgaggg 
tcaagtaaca agaacaaggc agaagataag accaaaaacc agtgttttgc aagaagcatc 
caaagcatag acaatcacct ggtgtttgtt ctctttgtct taacgagagg ctctctcttt 
ttatcaaagc agcttcttca cgtagaccaa gatctegtea aatcttgtct acttcttcct 
caactacttc ctctctttca tetgaeggtt cttcttctgt ctcttcttgt ccttctccta 
ttgttgatcg ccgacgatac ttgttaatgt ccggaggcag eggtagagga gagaaagtga 
tttcgtggat gacgaaaagc egatcegtag cttataaagt ggatgatgag aaaagaagga 
agaagaagac aaagacgaat agtggtttct tttttggttt ggtaatgggc acaaagaaga 
gacaataaga tagtcagtct tgtagatttg ttagattaga trgttcttga ttgtgtattt 
tcttgattct ttcattttag attgattget acatatagat gactagatca gtttggtggg 
tacgtgcaag tgagagagat tctttatacg tattcttgtg attgattccc taagcttgta 
gtttagtgtg gtaacataaa atgtgataat ccagatttaa tatgegtrga ac 
(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) 
(ix) 



(xi) 



MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..108 

(D) OTHER INFORMATION: / 
SEQUENCE DESCRIPTION: SEQ 



Ceres Seq. 
ID NO: 25: 



ID 1011756 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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Thr Gin lie Asp Glu Glu Met Ala Ser Leu Ser Thr Ser Leu Leu Gly 

15 10 15 

lie Arg Arg Gly Ser Ser Asn Lys Asn Lys Ala Glu Asp Lys Thr Lys 

20 25 30 

Asn Gin Cys Phe Ala Arg Ser lie Gin Ser lie Asp Asn His Leu Val 

35 40 45 

Phe Val Leu Phe Val Leu Thr Arg Gly Ser Leu Phe Leu Ser Lys Gin 

50 55 60 

Leu Leu His Val Asp Gin Asp Leu Val Lys Ser Cys Leu Leu Leu Pro 
65 7 0 7 5 8 0 

Gin Leu Leu Pro Leu Phe His Leu Thr Val Leu Leu Leu Ser Leu Leu 

85 90 95 

Val Leu Leu Leu Leu Leu lie Ala Asp Asp Thr Cys 
100 105 

INFORMATION FOR SEQ ID NO : 2 6 : 



(2) 



SEQUENCE CHARACTERISTICS: 

0 LENGTH: 102 amino acids 
TYPE: amino acid 
STRANDEDNESS : 



( B ) 



■; ix; 



i D) TOPOLOGY: 
MOLECULE TYPE : 
FEATURE: 

(A) NAME/ KEY : 
{ E ) LOCATION: 



linear 
peptide 



Met 
1 



Ala 



peptiae 
1 . . 102 

{ D ) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO:26: 
Ser Leu Ser Thr Ser Leu Leu Gly lie Arg Arg Gly Ser Ser 



ID 1011757 



5 10 

Asn Lys Asn Lys Ala Glu Asp Lys Thr Lys Asn Gin Cys 

20 25 

Ser lie Gin Ser lie Asp Asn His Leu Val Phe Val 



35 

Thr Arg Gly Ser Leu Phe 
50 

Asp Leu Val Lys Ser Cys 
65 70 
His Leu Thr Val Leu Leu 
85 

lie Ala Asp Asp Thr Cys 
100 

INFORMATION FOR SEQ 
(i) 



Leu 

55 

Leu 



40 
Ser 



Lys Gin Leu 



Leu Leu Pro 



Leu Ser Leu 



Leu 

90 



Gin 

75 

Val 



Leu 

60 

Leu 



Leu 
45 
His 



Phe 

30 

Phe 



15 
Ala 



Arg 



Val Leu 



Val Asp Gin 



Leu Pro Leu 



Leu Leu 



Leu Leu 
95 



Phe 

80 

Leu 



(2) 



(ii) 
(ix) 



(xi) 



DNA (genomic) 



1 . .422 



ID NO: 27: 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011832 
SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
aataacaata tcaaaagata aaaacagagt ttgcttttat aagatcaaag aaggtaaaaa 
aaaaaagtgg caaaatggag tgcgatcgta ttgataatga tggtgatgat tgttgcggtc 
acaatagaag cacaagaaga aagtggttgg actatttgtt ttcgccaatg ttctcagcct 
tgtcgcagtg acgacggtag ttgctatgaa aactgtaaaa tcgaatgcgg tggtcccaag 
cctcccttat ctcgtctacg aagctcacat gcatgagatg gcatcatcaa tggaagttcg 
cggaaggata ggatgataaa aagagatttg tagcggattt ttgatcatct ctactgtttt 
taacttcacg ttttatatta taagagtttc attaaaaaga tcaataaaac gagaaaatgt 

gg 

(2) INFORMATION FOR SEQ ID NO : 2 8 : 
(i) SEQUENCE CHARACTERISTICS: 



60 
120 
180 
240 
300 
360 
420 



WO 00/40695 



PCTYUS00/00466 



612 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 59 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011833 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Met Val Met lie Val Ala Val Thr lie Glu Ala Gin Glu Glu Ser 
15 10 15 

Gly Trp Thr lie Cys Phe Arg Gin Cys Ser Gin Pro Cys Arg Ser Asp 
2 0 2 5 ^30 

15 Asp Gly Ser Cys Tyr Glu Asn Cys Lys lie Glu Cys Gly Gly Pro Lys 
35 40 45 

Pro Pro Leu Ser Arg Leu Arg Ser Ser His Ala 

50 55 
(2) INFORMATION FOR SEQ ID NO: 29: 
2 0 i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 58 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011834 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

Met Val Met He Val Ala Val Thr He Glu Ala Gin Glu Glu Ser Gly 
15 10 15 

Trp Thr He Cys Phe Arg Gin Cys Ser Gin Pro Cys Arg Ser Asp Asp 
20 25 30 

35 Gly Ser Cys Tyr Glu Asn Cys Lys He Glu Cys Gly Gly Pro Lys Pro 
35 40 45 

Pro Leu Ser Arg Leu Arg Ser Ser His Ala 

50 55 
(2) INFORMATION FOR SEQ ID NO: 30: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
45 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 56 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011835 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met He Val Ala Val Thr He Glu Ala Gin Glu Glu Ser Gly Trp Thr 
15 10 15 

He Cys Phe Arg Gin Cys Ser Gin Pro Cys Arg Ser Asp Asp Gly Ser 
20 25 30 

55 Cys Tyr Glu Asn Cys Lys He Glu Cys Gly Gly Pro Lys Pro Pro Leu 
35 40 45 

Ser Arg Leu Arg Ser Ser His Ala 

50 " 55 
(2) INFORMATION FOR SEQ ID NO: 31: 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 
<B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1..713 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011907 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



aattctggtt tttcttgcaa tgcttcttcc atttctctcc tcctctctcc tcctccgtct 60 

ctcttctccg atcatgtccc ctgatcccaa atccaatacc ttcttcttct ctttttccaa 120 

ttgattaccg gagaatttca gatcgcgatc tccgatttat taataattac agaaaaaaat 180 

aattaaaccc tagagagaga gagagatata tatatacatg gagagttcat taggtttcat 240 

ggcggttttc gccgtctcag gaagcgttgt gttcttagcg agtcaatttc acaagcgtct 300 

tctctccgat tacatggaca agttcgaatt cgaaatccga gcgcagaaaa aaatggtgat 360 

gaagaagaag gtgaqattcg cggcggatgt ggtggagccg tcggggaata acaaagagta 420 

tcgccggaga caitcttcca aggctaaatc gaattcgaag atggcggcaa ctatttgact 480 

ttaaggtttt ttgtacaaaa tttaatgtgt aatttattca tttgggtttt tgtgatttga 540 

aattcgtaaa tttaatttcg gaatctgaat ctgggctaaa cttttcaagt cttccccagg 600 

gtcaattctt tttcttcttc ttttttttct tgaggggttt ttttttgttg rtgtgacctg 660 



taaataaatg tgrtgtttgt gtatatacaa aatgtgattg ctaattacct ttt 
i2) INFORMATION FOR SEQ ID NO: 32: 
:i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 
( C } STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 8 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011908 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



Met 


Glu 


Ser 


Ser 


Leu 


Gly 


Phe 


Met Ala 


Val 


Phe 


Ala 


Val 


Ser 


Gly Ser 


1 








5 








10 










15 


Val 


Val 


Phe 


Leu 


Ala 


Ser 


Gin 


Phe His 


Lys 


Arg 


Leu 


Leu 


Ser 


Asp Tyr 








20 








25 










30 




Met 


Asp 


Lys 


Phe 


Glu 


Phe 


Glu 


He Arg 


Ala 


Gin 


Lys 


Lys 


Met 


Val Met 






35 










40 








45 






Lys 


Lys 


Lys 


Val 


Arg 


Phe 


Ala 


Ala Asp 


Val 


Val 


Glu 


Pro 


Ser 


Gly Asn 




50 










55 








60 








Asn 


Lys 


Glu 


Tyr 


Arg 


Arg 


Arg 


His Ser 


Ser 


Lys 


Ala 


Lys 


Ser 


Asn Ser 


65 










70 








75 








80 


Lys 


Met 


Ala 


Ala 


Thr 
85 


He 


















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 33: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..7 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011909 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
Met Ala Val Phe Ala Val Ser Gly Ser Val Val Phe Leu Ala Ser Gin 
15 10 15 

Phe His Lys Arg Leu Leu Ser Asp Tyr Met Asp Lys Phe Glu Phe Glu 

20 25 30 

He Arg Ala Gin Lys Lys Met Val Met Lys Lys Lys Val Arg Phe Ala 
35 40 45 
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Ala Asp Val Val Glu Pro Ser Gly Asn Asn Lys Glu Tyr Arg Arg Arg 

50 55 60 

His Ser Ser Lys Ala Lys Ser Asn Ser Lys Met Ala Ala Thr lie 
65 70 75 

5 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

( B ) LOCATION: 1..5 9 

15 (D) OTHER INFORMATION: / Ceres Seq. ID 1011910 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
Phe Trp Phe Phe Leu Gin Cys Phe Phe His Phe Ser Pro Pro Leu Ser 
15 10 15 

Ser Ser Val Ser Leu Leu Arg Ser Cys Pro Leu He Pro Asn Pro He 
20 20 ~ 25 30 

Pro Ser Ser Ser Leu Phe Pro He Asp Tyr Arg Arg lie Ser Asp Arg 

35 40 45 

Asp Leu Arg Phe He Asn Asn Tyr Arg Lys Lys 
50 55 
2 5 (2) INFORMATION FOR SEQ ID NO: 35: 

:i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 580 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 1011911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
aattgtttta aaattacaaa ttagtccgtt cttttattcc cgtactcgtt ccttcttctt 60 
cttcttcctc tcatcgtcat tttctcgatt ctcactcttc cggtcaccga ctaattctga 120. 
ataaggttta tcaaaagaat aagaataagt ggataaaaag ctagctttga aagagttatt 180 

40 gcagagaaaa aaaatgggat cgagagggat tatcaacgat aagtggtcaa tgaggattct 240 
atggggttgt gctatcggaa gtgctattgg tttatacatg gttgctgtag agagacaaac 300 
tcagaacagg gctcgtgcta tggctgagag tttgagagct gctgaatcac aaggtgatgg 360 
tgataatgtc taatatctac caagtagtgc tcagttgaat actctcagtt gagttttttt 420 
ttttggtgtt tgtttttgtt ataatgactt cttctgccaa gatggtgttg atgtagtttc 480 

4 5 ttttttgcaa ataatcgtaa taaggtttcg aaacttggag agttgaagtt gctgaacata 540 

cgatttgtgt tatcgcaaaa aaagttattt cttatgcctg 
(2) INFORMATION FOR SEQ ID NO:36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

5 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

55 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 59 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011912 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
Met Gly Ser Arg Gly lie He Asn Asp Lys Trp Ser Met Arg He Leu 
60 1 5 10 15 

Trp Gly Cys Ala He Gly Ser Ala He Gly Leu Tyr Met Val Ala Val 
20 25 30 
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Glu Arg Gin Thr Gin Asn Arg Ala Arg Ala Met Ala Glu Ser Leu Arg 

35 40 45 

Ala Ala Glu Ser Gin Gly Asp Gly Asp Asn Val 

50 55 
(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION : 



peptide 
1 . . 47 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011913 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Arg lie Leu Trp Gly Cys Ala lie Gly Ser Ala lie Gly Leu Tyr 
5 10 15 

Met Val Ala Val Glu Arg Gin Thr Gin Asn Arg Ala Arg Ala Met Ala 
20 25 20 

Ser Leu Arg Ala Ala Glu Ser Gin Gly Asp Gly Asp Asn Val 

35 40 45 

INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



Met 
1 



Glu 



(2) 



(ii) 
(ix) 



MOLECULE TYPE: 
FEATURE : 

(A) NAME/KEY: 

(B) LOCATION: 



(xi) 



peptide 
1 . . 40 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



ID 1011914 



lie Val Leu Lys Leu Gin lie Ser Pro Phe Phe Tyr Ser Arg Thr Arg 



1 

Ser 



10 



15 



Phe Phe Phe Phe Phe Leu Ser Ser Ser Phe Ser Arg Phe Ser Leu 



20 



25 



30 



Phe Ara Ser Pro Thr Asn Ser Glu 



35 



40 



(2) 



INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



DNA (genomic) 



1 . .415 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011954 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
tcacaaaacc tgattgttct gtttcgaacc gatagattcg aaccttggtt taagttgccg 
gcttcgagga atctccggtg tggactttca ctcctgtttc ttcttcatga cgcaagctgt 
tttctctaga atacgctttc ttagaaccct ggaaaacgtt acccagcttc ctcaatcctc 
tagctctagc tcgaagaaca tcctttgaaa cacttggatc tttaagcacc gcttggcaga 
cacgagataa ggttgattcg atgtcaacaa catttatttg ccaaagggat tgaagcattg 
catccttttt agcttcaatg gctttctgaa tatgtctctg atctttgtga ttgtgatgat 
catgcatgtc ctatcgttgc gatgtatttt ataaatgttc gtctgtaagt tattt 
(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 



60 
120 
180 
240 
300 
360 
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(B) TYPE: amino a^cid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. . 42 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011955 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 : 
Ser Gin Asn Leu lie Val Leu Phe Arg Thr Asp Arg Phe Glu Pro Trp 
15 10 15 

Phe Lys Leu Pro Ala Ser Arg Asn Leu Arg Cys Gly Leu Ser Leu Leu 

20 25 ~ 30 

Phe Leu Leu His Asp Ala Ser Cys Phe Leu 

35 40 
(2) INFORMATION FOR SEQ ID NO : 4 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
'O STRANDEDNESS : 

; D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 33 

( D) OTHER INFORMATION: / Ceres Seq. ID 1011956 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 
Met Thr Gin Ala Val Phe Ser Arg lie Arg Phe Leu Arg Thr Leu Glu 
15 10 15 

Asn Val Thr Gin Leu Pro Gin Ser Ser Ser Ser Ser Ser Lys Asn lie 
20 25 30 

Leu 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 40 base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/ KEY: - 

(B) LOCATION: 1..4 40 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011960 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



gacagtcagt cactgtaaca ttttagatct ttcccgaaga agaaaacgaa gaagagacga 60 

agagagaaat gaggccgatg cagctggata tgttatcgga gatggatgat gcaggttctt 120 

cgatggccat ggacgttgat gacctcgaag ccatggagat actcaacgaa ggaggacttg 180 

tctcagataa caagctcgcc gacgccgatt tcttcaacaa attcgatgat gatttcgatg 240 

acaccgatat caactaaatc cggttaagtt ttctctttaa gatttcagtt ttttttccta 300 

aacttgatgt aatttgtgga gaaacctatg tagtaagtgt gtaataagtt ctgggatttt 360 

tatgtttgtg gtatgtaaca atgtttttct tcatttggat tactaaaaca tttgatttgt 420 



gtgttattgt gctttgtctg 
(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
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(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011961 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 
Met Arg Pro Met Gin Leu Asp Met Leu Ser Glu Met Asp Asp Ala Gly 
15 10 15 

Ser Ser Met Ala Met Asp Val Asp Asp Leu Glu Ala Met Glu lie Leu 

20 25 30 

Asn Glu Gly Gly Leu Val Ser Asp Asn Lys Leu Ala Asp Ala Asp Phe 

35^ 40 45 

Phe Asn Lys Phe Asp Asp Asp Phe Asp Asp Thr Asp lie Asn 

50 55 60 

■;2) INFORMATION FOR SEQ ID MO : 4 4 : 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
■ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 5 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011962 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 : 
Met Gin Leu Asp Met Leu Ser Glu Met Asp Asp Ala Gly Ser Ser Met 
1 5 10" 15 

Ala Met Asp Val Asp Asp Leu Glu Ala Met Glu lie Leu Asn Glu Gly 

20 25 30 

Gly Leu Val Ser Asp Asn Lys Leu Ala Asp Ala Asp Phe Phe Asn Lys 

35 40 45 

Phe Asp Asp Asp Phe Asp Asp Thr Asp lie Asn 

50 55 
(2) INFORMATION FOR SEQ ID NO : 4 5 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. .55' 

(D) OTHER INFORMATION: / Ceres Seq. ID 1011963 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 : 
Met Leu Ser Glu Met Asp Asp Ala Gly Ser Ser Met Ala Met Asp Val 
15 10 15 

Asp Asp Leu Glu Ala Met Glu lie Leu Asn Glu Gly Gly Leu Val Ser 

20 25 ~ 30 

Asp Asn Lys Leu Ala Asp Ala Asp Phe Phe Asn Lys Phe Asp Asp Asp 

35 40 45 

Phe Asp Asp Thr Asp lie Asn 

50 55 
(2) INFORMATION FOR SEQ ID NO : 4 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .193 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014075 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 
aaaaccctta agctaagcaa atttcaaagc aattcaaaaa ccctagcctc tctctcttta 60 
tctctcacaa caaaatcttt gaaatggcga tctctaaggc ttccattgtt gttctcatga 120 
tggtgattat ctccgtcgtt gcatcggcac agtctgaggc accagcacca agtcctactt 180 
5 ctggatctag teg 

(2) INFORMATION FOR SEQ ID NO : 4 7 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
15 (B) LOCATION: 1 . . 3 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014076 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 : 
Lys Thr Leu Lys Leu Ser Lys Phe Gin Ser Asn Ser Lys Thr Leu Ala 
l" 5 10 .15 

20 Ser Leu Ser Leu Ser Leu Thr Thr Lys Ser Leu Lys Trp Arc Ser Leu 

2 0 2 5 3 0 

Arg Leu Pro Leu Leu Phe Ser 
35 

(2; INFORMATION FOR SEQ ID NO : 4 8 : 
2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
30 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 36 

(D) OTHER INFORMATION: / Ceres Seq. ID 1014077 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 : 

Met Ala lie Ser Lys Ala Ser lie Val Val Leu Met Met Val He He 
15 10 15 

Ser Val Val Ala Ser Ala Gin Ser Glu Ala Pro Ala Pro Ser Pro Thr 
20 25 30 

40 Ser Gly Ser Ser 
35 

(2) INFORMATION FOR SEQ ID NO : 4 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
50 (A) NAME/KEY: - 

(B) LOCATION: 1 . .711 

(D) OTHER INFORMATION: / Ceres Seq. ID 1015865 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
actcttccgg tcaccgacta attctggtta gctcacattc ettatgetea ttcatccaca 60 

55 gttccctaat taacgctacg aattttgttt ctcctcttat ttcttctttc ctcttgtaga 120 
ataaggttta tcaaaagaat aagaataagg tttatctctt aaatctccga taattagcag 180 
agttttttca ggattgagtt tgatttgctg ttttggaatc ataaattttg cgttttagtg 240 
gataaaaagc tagcttaaag agttattgca gagaaaaaaa atgggatcga gagggattat 300 
caacgataag tggtcaatga ggattctatg gggttgtgct ateggaagtg ctattggttt 360 

60 atacatggtt gctgtagaga gacaaactca gaacagggct cgtgctatgg ctgagagttt 420 
gagagctget gaatcacaag gtgatggtga taatgtctaa tatctaccaa gtagtgctca 480 
gttgaatact ctcagttgag tttttttttt tggtgtttgt ttttgttata atgacttctt 540 
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ctgccaagat ggtgttgatg tagtttcttt 
cttggagagt tgaagttgct gaacatacga 
atgcctgtca tgctatgttt gtgaattcga 
(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 



619 
tttgcaaata 
tttgtgttat 
tttttaatgc 



at cgtaataa 
cgcaaaaaaa 
gtattttcag 



ggtttcgaaa 
gttatttctt 
c 



600 
660 



Met 
1 

Trp 

Glu 



;ix) 



(xi) 
Gly 



(D) TOPOLOGY 
MOLECULE TYPE 
FEATURE: 

(A) NAME /KEY 

(B) LOCATION 



linear 
peptide 



peptide 
1 . . 59 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



ID 1015866 



Ser Arg 



Gly Cys 



.Ala A 



Gin 
35 
Glu 



Ala 

20 

Thr 



Gly 
He 



He He Asn Asp 



Gly Ser Ala 



Gin Asn Aro 



Ala 
4 0 
Gly 



He 

25 
Arg 



Lys 

10 

Gly 



Trp Ser Met Arg lie Leu 
15 
Ala 



Leu Tyr Met 



Ala Met 



Glu 

45 



Val 
30 
Ser 



Val 



Leu Arg 



Asp Asn 



(2) 



Ser Gin Gly Asp 
50 55 
INFORMATION FOR SEQ ID NO: 51: 
fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 47 amino acids 
<B> TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE : 

(A) NAME/KEY: 

(B) LOCATION: 



(ii) 
(ix) 



Met 
1 

Met 



(xi) 
Arg 



peptide 
1 . . 47 

(D) OTHER INFORMATION: / Ceres Seq. ID 1015867 
SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
He Leu Trp Gly Cys Ala He Gly Ser Ala He Gly 



Tyr 



- 7 al Ala 



Val 
20 
Arg 



5 

Glu 



Arg Gin Thr 



Glu Ser Leu 
35 

(2) INFORMATION 

(i 



Ala Ala Glu Ser 
40 

FOR SEQ ID NO: 52 : 
SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: 



10 

Gin Asn Arg Ala Arg Ala 
25 30 
Gin Gly Asp Gly Asp Asn 
45 



Leu 
15 

Met Ala 
Val 



(D) TOPOLOGY 
(ii) MOLECULE TYPE 
(ix) FEATURE: 

(A) NAME/KEY 

(B) LOCATION 



linear 
peptide 



peptide 
1. .38 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



ID 1015868 



Ser 
1 

Phe 



(xi) 
Ser Gly His 



He His 



Ser 
20 
Phe 



Arg 
5 

Ser 



Leu He Leu Val 



Leu He Asn 



Ala 

25 



Ser 

10 

Thr 



Ser His Ser Leu 



Asn Phe Val 



Ser 
30 



Cys Ser 
15 

Pro Leu 



Leu Leu 



He Ser Ser 

35 

(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 432 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021371 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

10 gtcattttct cgattctcac tcttccggtc accgactaat tctgagaaaa aaaatgggat 60 
cgagagggat tatcaacgat aagtggtcaa tgaggattct arggggttgt gctatcggaa 120 
gtgctattgg tttatacatg gttgctgtag agagacaaac tcagaacagg gctcgtgcta 180 
tggctgagag tttgagagct gctgaatcac aaggtgatgg tgataatgtc taatatctac 240 
caagtagtgc tcagttgaat actctcagtt gagttttttt ttttggtgtt tgtt tttgtt 300 

15 ataatgactt cttctgccaa gatggtgttg atgtagtttc ttttttgcaa ataatcgtaa 360 
taaggtttcg aaacttggag agttgaagtt gctgaacata cgatttgtgt tatcqcaaaa 420 
aaagttattt cc 

(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 7 6 amino acids 

v B } TYPE: amino acid 
■ C) STRANDEDNESS : 
{ D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
2 5 (ix) FEATURE: 

(A) NAME /KEY : peptide 
{ B ) LOCATION: 1 . . 7 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021372 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
30 His Phe Leu Asp Ser His Ser Ser Gly His Arg Leu lie Leu Arg Lys 
1 * 5 10 15 

Lys Met Gly Ser Arg Gly lie lie Asn Asp Lys Trp Ser Met Arg lie 

20 25 ~ " ^ 30 

Leu Trp Gly Cys Ala lie Gly Ser Ala lie Gly Leu Tyr Met Val Ala 
35 35 4 0 4 5 

Val Glu Arg Gin Thr Gin Asn Arg Ala Arg Ala Met Ala Glu Ser Leu 

50 55 60 

Arg Ala Ala Glu Ser Gin Gly Asp Gly Asp Asn Val 
65 70 75 

4 0 (2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

4 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 59 

50 (D) OTHER INFORMATION: / Ceres Seq. ID 1021373 



55 



60 





(xi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO:55: 












Met 


Gly 


Ser Arg 


Gly lie 


lie Asn Asp 


Lys 


Trp 


Ser 


Met 


Arg 


He 


Leu 


1 






5 




10 










15 




Trp 


Gly 


Cys Ala 


lie Gly 


Ser Ala lie 


Gly 


Leu 


Tyr 


Met 


Val 


Ala 


Val 






20 




25 










30 






Glu 


Arg 


Gin Thr 


Gin Asn 


Arg Ala Arg 


Ala 


Met 


Ala 


Glu 


Ser 


Leu 


Arg 






35 




40 








45 








Ala 


Ala 


Glu Ser 


Gin Gly 


Asp Gly Asp 


Asn 


Val 














50 






55 
















(2) 


INFORMATION 


FOR SEQ 


ID NO:56: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 4 7 

(D) OTHER INFORMATION: / Ceres Seq. ID 1021374 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
10 Met Arg lie Leu Trp Gly Cys Ala lie Gly Ser Ala He Gly Leu Tyr 
15 10 15 

Met Val Ala Val Glu Arg Gin Thr Gin Asn Arg Ala Arg Ala Met Ala 

20 25 30 

Glu Ser Leu Arg Ala Ala Glu Ser Gin Glv Asp Gly Asp Asn Val 
15 35 4 0 4 5 

(2) INFORMATION FOR SEQ ID NO: 57: 
■;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

2 0 ;C) STRANDEDNESS: single 

i D ) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic; 
(ix) FEATURE: 

(A) NAME /KEY: - 

2 5 (B) LOCATION: 1..28 6 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022578 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
caccattgga tggttcggct atacaagctg aaacctccca ggaacaggat tcgtggtaaa 60 
gcaaagaaac tgaaatcgaa aacaagctct ggattgagtt caaaatcagc gaagaagaac 120 
30 ccgtgggtct agaaatccta gaattttcaa tgttggatgg tgagttatat gtcaactctt 180 
aggtctcttt attatcgaga aagttgttgc agacgcagtc aagtttctgg tgtaggattc 240 
gtaaaaattt tgttaacttt attcgaattt catgtttatt gcggag 
(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

£D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
4 0 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 43 

(D) OTHER INFORMATION: ./ Ceres Seq. ID 1022579 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

4 5 His His Trp Met Val Arg Leu Tyr Lys Leu Lys Pro Pro Arg Asn Arg 
1 5 10 15 

lie Arg Gly Lys Ala Lys Lys Leu Lys Ser Lys Thr Ser Ser Gly Leu 

20 25 30 

Ser Ser Lys Ser Ala Lys Lys Asn Pro Trp Val 
50 35 40 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
55 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
60 (B) LOCATION: 1 . . 43 

(D) OTHER INFORMATION: / Ceres Seq. ID 1022580 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
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Met Val Ser Tyr Met Ser Thr Leu Arg Ser Leu Tyr Tyr Arg Glu Ser 
1 5 10 15 

Cys Cys Arg Arg Ser Gin Val Ser Gly Val Gly Phe Val Lys lie Leu 
2cT 25 30 

5 Leu Thr Leu Phe Glu Phe His Val Tyr Cys Gly 
35 4 0 

(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
■:ii) MOLECULE TYPE: peptide. 
;ix) FEATURE: 

15 (A) NAME /KEY : peptide 

(B) LOCATION: 1..4 0 

{ D) OTHER INFORMATION: / Ceres Seq. ID 1022581 
,xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
Met Val Arg Leu Tyr Lys Leu Lys Pro Pro Arg Asn Ara lie Arg Gly 
20 1 5 10 15 

Lys Ala Lys Lys Leu Lys Ser Lys Thr Ser Ser Gly Leu Ser Ser Lys 

2 0 2 5 30 

Ser Ala Lys Lys Asn Pro Trp Val 
35 40 
25 (2) INFORMATION FOR SEQ ID NO : 6 1 : 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 643 

35 (D) OTHER INFORMATION: / Ceres Seq. ID 1024240 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
agactcagtg aggattttct aggcgatttt ctcgagaaaa ccgtttgagg aaaatggaga 60 
cttcratgag gtataccagc aattccaagt cuatgaagat tcatgccaaa gagaaggttc 120 
cggtgaactc aaaaacccat ttacagcttc atggagagtt agataccgga actggggctc 180 

40 cgagttactt ctgtgcgatg attagacact tttttcctga ggccttgggg taggattgca 240 
ttatgataag cgccaaaagc ttcggtgtct tgtacgcgga aaaaaagagt ttcctgtaag 300 
agctgataag cgtgtaacct ttaatattaa agggcggtgt gatattgatc aggacttaaa 360 
tcagaagaac cccaaaggag cagcagaatt tgcctggaac ataatggatt tcaaggaaga 420 
tcaggatgta cggatcaaag ttggctacga aatgtttgat aaggtccctt atatgcagat 480 

45 tagagaaaac aattggactc tcaacgcgaa catgaaggga aaatggaact tgcggtatga 540 
cctgtaactg catttttttc aatcatcatc tgagaaatgt attgatacca ctgctgatga 600 
acacatttta attctaccaa ttaatcaaat tcagagatct tec 
{2} INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

55 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 5 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024241 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
60 Met Giu Thr Ser Met Arg Tyr Thr Ser Asn Ser Lys Ser Met Lys lie 
15 10 15 

His Ala Lys Glu Lys Val Pro Val Asn Ser Lys Thr His Leu Gin Leu 
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20 25 30 

His Gly Glu Leu Asp Thr Gly Thr Gly Ala Pro Ser Tyr Phe Cys Ala 

35 40 45 

Met He Arg His Phe Phe Pro Glu Ala Leu Gly 
5 50 55 

(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
15 (B) LOCATION: 1..55 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024242 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
Met Arg Tyr Thr Ser Asn Ser Lys Ser Met Lys He His Ala Lys Glu 
1 5 10 . 15 

2 0 Lys Val Pro Val Asn Ser Lys Thr His Leu Gin Leu His Gly Glu Leu 

20 25 30 

Asp Thr Gly Thr Gly Ala Pro Ser Tyr Phe Cys Ala Met He Arg His 

35 40 " 45 

Phe Phe Pro Glu Ala Leu Gly 
25 50 55~ 

(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

( B) TYPE: amino acid 
30 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
35 <B) LOCATION: 1 . . 4 7 

(D) OTHER INFORMATION: / Ceres Seq. ID 1024243 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
Met Asp Phe Lys Glu Asp Gin Asp Val Arg lie Lys Val Gly Tyr Glu 
15 10 15 

4 0 Met Phe Asp Lys Val Pro Tyr Met Gin lie Arg Glu Asn Asn Trp Thr 

20 25 30 

Leu Asn Ala Asn Met Lys Gly Lys Trp Asn Leu Arg Tyr Asp Leu 

35 4 0 4 5 

(2) INFORMATION FOR SEQ ID NO: 65: 
4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 729 

(D) OTHER INFORMATION: / Ceres Seq. ID 1026562 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

gttaatcata taaatttgac cctaaaatct atctcattct ctcatcaatc ttcttccttt 60 
aaatccctaa atcttcacgt tacttttcta tctacatttt cactttcaat ggctttgatt 120 
atcacctgct ccgctctacc cacgatccga gcatcttccg gatccggatc ccttaacccg 180 
gatcaaaacc gtaagaaatc tgctgcttgg tgggctcctc tcttcggttt accctccgat 240 

60 ccagattacc tcaacatcga aagctcatgc tccaccgtga atccggataa aaccgatatt 300 
tccgggtcgg gtcaaaagtt tcgtcgcggt tgttttacgg aggagaaagc taagcagttg 360 
aggaggaaaa ccgcagaagc ttccacgttc catgacgtaa tgtatcactc cgccattgct 420 
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tctcgacttg cgtctgatat caccggccga gtcgaggatt gaaacggatt cgggtcagat 480 
ctggagacga atcaattaat gaataatgaa ttaatctttt ttttcttaat tcagtgttct 540 
tgtaggatgc agaccatctt ctatcggtgc tcctttttca tttgaaccgt tggcttgtgt 600 
ttggtctttg tgtgttgtaa actctggttc cctttctgtt tcttctgtaa tcaaccgttg 660 
5 gatttccact tgttattttt agacatgtgt tccctagtta gctcttttat cttatctcaa 720 
atttaaggc 

(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii; MOLECULE TYPE: peptide 
(ix) FEATURE: 

15 (A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 153 

(D) OTHER INFORMATION: / Ceres Seq. ID 1026563 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
Val Asn His lie Asn Leu Thr Leu Lvs Ser lie Ser Phe Ser His Gin 
20 1 5 10 15 

Ser Ser Ser Phe Lys Ser Leu Asn Leu His Val Thr Fhe Leu Ser Thr 

20 25 30 

Phe Ser Leu Ser Met Ala Leu lie He Thr Cys Ser Ala Leu Pro Thr 
35 40 45 

2 5 He Arg Ala Ser Ser Gly Ser Gly Ser Leu Asn Pro Asp Gin Asn Arg 
50 " 55 60 

Lys Lys Ser Ala Ala Trp Trp Ala Pro Leu Phe Gly Leu Pre Ser A.sp 
65 70 75 80 

Pro Asp Tyr Leu Asn lie Glu Ser Ser Cys Ser Thr Val Asn Pro Asp 
30 85 90 95 

Lys Thr Asp lie Ser Gly Ser Gly Gin Lys Phe Arg Arg Gly Cys Phe 

100 105 110 

Thr Glu Glu Lys Ala Lys Gin Leu Prg Arg Lys Thr Ala Glu Ala Ser 
115 ' ' 120 ' 125 

35 Thr Phe His Asp Val Met Tyr His Ser Ala He Ala Ser A.rg Leu Ala 
130 135 140 

Ser Asp He Thr Gly Arg Val Glu Asp 
145 150 
(2) INFORMATION FOR SEQ ID NO: 67: 
4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 
{ B ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .117 

(D) OTHER INFORMATION: / Ceres Seq. ID 1026564 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 67 : 

Met Ala Leu He He Thr Cys Ser Ala Leu Pro Thr He Arg Ala Ser 
1 5 10 15 

Ser Gly Ser Gly Ser Leu Asn Pro A.sp Gin Asn Arg Lys Lys Ser Ala 
20~ 25 30 

5 5 Ala Trp Trp Ala Pro Leu Phe Gly Leu Pro Ser Asp Pro Asp Tyr Leu 

35 40 45 

Asn He Glu Ser Ser Cys Ser Thr Val Asn Pro Asp Lys Thr Asp He 

50 55 60 

Ser Gly Ser Gly Gin Lys Phe Arg Arg Gly Cvs Phe Thr Glu Glu Lys 
60 65 ^ 70 " " 75 80 

Ala Lys Gin Leu Arg Arg Lys Thr Ala Glu Ala Ser Thr Phe His Asp 
85 90 95 



WO 00/40695 



PCT/US00/00466 



625 

Val Met Tyr His Ser Ala lie Ala Ser Arg Leu Ala Ser Asp lie Thr 

100 105 110 

Gly Arg Val Glu Asp 
115 

(2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
fix) FEATURE: 

(A) NAME/KEY: - 

£ B ) LOCATION: I . .484 

(D) OTHER INFORMATION: / Ceres Seq. ID 1026648 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
aaggatcaca aaattagggt tttaatttga tagagaagat gaanttcaga agctttgagg 
agttctggcc tttctacatg atgcaacact cgaatccatc gacgcgaagg ttgcacttca 
taggtatcat cgcgagcatc gttgctttga tatgttcqat tttgatcaac tggtggttct 
tagctctggt gcctctgctt. gggtacggat tcgcgtggta tagccacttc ttcgtggaag 
TgaargLtcc ggcgagcttt gggcatccgc tttggrcgtt tctctgcgat ctcaagatgt 
tragrcrgat gctcacagga agcatggaga gagagatgaa gagacttggt aagaggccat 
tgttgcagct ctcttgaagc agcagcagat atctctatag aattgttctt gattcttctt 
attgcgttct qcgattggat tttagactcc agtttgtaat tacttcatgg aatcgttgtt 
tgcg 

(2) INFORMATION FOR SEQ ID NO: 69: 
{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 112 

(D) OTHER INFORMATION: / Ceres Seq. ID 1026649 
:xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



Met 


Asn 


Phe 


Arg 


Ser 


Phe 


Glu 


Glu 


Phe 


Trp 


Pro 


Phe 


Tyr 


Met 


Met 


Gin 


1 








5 










10 










15 




His 


Ser 


Asn 


Pro 


Ser 


Thr 


Arg 


Arg 


Leu 


His 


Phe 


He 


Gly 


He 


He 


Ala 








20 










25 










30 






Ser 


He 


Val 


Ala 


Leu 


He 


Cys 


Ser 


He 


Leu 


He 


Asn 


Trp 


Trp 


Phe 


Leu 






35 










40 










45 








Ala 


Leu 


Val 


Pro 


Leu 


Leu 


Gly 


Tyr 


Gly 


Phe 


Ala 


Trp 


Tyr 


Ser 


His 


Phe 




50 










55 










60 










Phe 


Val 


Glu 


Gly 


Asn 


Val 


Pro 


Ala 


Ser 


Phe 


Gly 


His 


Pro 


Leu 


Trp 


Ser 


65 










70 










75 










80 


Phe 


Leu 


Cys 


Asp 


Leu 


Lys 


Met 


Phe 


Ser 


Leu 


Met 


Leu 


Thr 


Gly 


Ser 


Met 










85 










90 










95 




Glu 


Arg 


Glu 


Met 


Lys 


Arg 


Leu 


Gly 


Lys 


Arg 


Pro 


Leu 


Leu 


Gin 


Leu 


Ser 








100 










105 










110 







(2) INFORMATION FOR SEQ ID NO : 7 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 99 

(D) OTHER INFORMATION: / Ceres Seq. ID 1026650 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
Met Met Gin His Ser Asn Pro Ser Thr Arg Arg Leu His Phe lie Gly 
15 10 15 

lie lie Ala Ser He Val Ala Leu He Cys Ser He Leu He Asn Trp 
5 20 25 30 

Trp Phe Leu Ala Leu Val Pro Leu Leu Gly Tyr Gly Phe Ala Trp Tyr 

35 4 0 4 5 

Ser His Phe Phe Val Glu Gly Asn Val Pro Ala Ser Phe Gly His Pro 
50 55 60 

10 Leu Trp Ser Phe Leu Cys Asp Leu Lys Met Phe Ser Leu Met Leu Thr 
65 70 75 80 

Gly Ser Met Glu Arg Glu Met Lys Arg Leu Gly Lys arg Pro Leu Leu 
85 90 95 

Gin Leu Ser 

15 

(2) INFORMATION FOR SEQ ID NO: 71: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
2 0 (C) STRANDEDNESS : 

{ D ) TOPOLOGY: linear 
Hi) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

2 5 (B) LOCATION: 1. . 98 

(D) OTHER INFORMATION: / Ceres Seq. ID 1026651 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 1 : 
Met Gin His Ser Asn Pro Ser Thr Arg Arg Leu His Phe He Gly He 
15 10 15 

30 He Ala Ser He Val Ala Leu He Cys Ser He Leu He Asn Trp Trp 

20 25 30 

Phe Leu Ala Leu Val Pro Leu Leu Gly Tyr Gly Phe Ala Trp Tyr Ser 

35 4 0 4 5 

His Phe Phe Val Glu Gry Asn Val Pro Ala Ser Phe Gly His Pro Leu 

3 5 50 ' 55 60 

Trp Ser Phe Leu Cys Asp Leu Lys Met Phe Ser Leu Met Leu Thr Gly 
65 70 ' 75 80 

Ser Met Glu Arg Glu Met Lys Arg Leu Gly Lys Arg Pro Leu Leu Gin 
85 " 90 " 95 

40 Leu Ser 

(2) INFORMATION FOR SEQ ID NO: 72: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(ix) FEATURE: 
50 (A) NAME/KEY: - 

(B) LOCATION: 1 . .563 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027881 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

aaaacacaga gagtgacgag agagagagag tgaaaatgga gaagtacttt ggaaatgcgt 60 

55 acaggggaga tccaggagtg ccacatgcag atgcagatcg tttcatgaat atatggattg 120 

gttctgctgc tttctccgtt cttacctggg ttaatcctta catgtggcag ctctctaacc 180 

agttcaatta ccatgacaag tggatgctgt ttgagcagta ccactggaaa aaagcaaggg 240 

caaagaagca accttatgaa ttcaagtgga ataagatacc caaaqaagtc agggactcgt 300 

actattacaa ctggccagtc tacttcccat agaagtgtct ctgtgttgct gtgaatcaga 360 

60 agagaaacca aacacttgtg gaacttctat ggaatctatc tctatctctt cttgatctgt 420 

tattcgtagc acttttggat tttaagtttt ttcttttgta atgatcaatc tctcaactct 480 

tcactgtttc atctcctcta gttgactctc ttatggaaat attgaatctc tttgaagagc 540 
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agaaccagaa tctgtctgtt atg 
(2) INFORMATION FOR SEQ ID NO:73: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 109 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027882 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 



Asn 


Thr 


Glu 


Ser 


Asp 


Glu 


Arg 


Glu 


Arg 


Val 


Lys 


Met 


Glu 


Lys 


Tyr 


Phe 


1 








5 










10 










15 




Gly 


Asn 


Ala 


Tyr 

20 


Arg 


Gly 


Asp 


Pro 


Gly 
25 


Val 


Pro 


His 


Ala 


Asp 
30 


Ala 


Asp 


Arg 


Phe 


Met 

35 


Asn 


He 


Trp 


He 


Gly 
40 


Ser 


Ala 


Ala 


Phe 


Ser 
4 5 


Val 


Leu 


Thr 


Trp 


Val 
50 


Asn 


Pro 


Tyr 


Met 


Trp 
55 


Gin 


Leu 


Ser 


Asn 


Gin 

60 


Phe 


Asn 


Tyr 


His 


Asp 


Lys 


Trp 


Met 


Leu 


Phe 


Glu 


Gin 


Tyr 


His 


Trp 


Lys 


Lys 


Ala 


Arg 


Ala 


65 










70 










75 










80 


Lys 


Lys 


Gin 


Pro 


Tyr 

85 


Glu 


Phe 


Lys 


Trp 


Asn 
90 


Lys 


He 


Pro 


Lys 


Glu 
95 


Val 


Arg 


Asp 


Ser 


Tyr 
100 


Tyr 


Tyr 


Asn 


Trp 


Pro 
105 


Val 


Tyr 


Phe 


Pro 









(2) INFORMATION FOR SEQ ID NO : 7 4 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 
{ B ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
(3) LOCATION: I . . 98 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027883 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 4 : 



Met 


Glu 


Lys 


Tyr 


Phe 


Gly 


Asn 


Ala 


Tyr 


Arg 


Gly 


Asp 


Pro 


Gly 


Val 


Pro 


1 








5 










10 










15 




His 


Ala 


Asp 


Ala 


Asp 


Arg 


Phe 


Met 


Asn 


He 


Trp 


He 


Gly 


Ser 


Ala 


Ala 








20 










25 










30 






Phe 


Ser 


Val 


Leu 


Thr 


Trp 


Val 


Asn 


Pro 


Tyr 


Met 


Trp 


Gin 


Leu 


Ser 


Asn 






35 










40 










45 








Gin 


Phe 


Asn 


Tyr 


His 


Asp 


Lys 


Trp 


Met 


Leu 


Phe 


Glu 


Gin 


Tyr 


His 


Trp 




50 










55 










60 










Lys 


Lys 


Ala 


Arg 


Ala 


Lys 


Lys 


Gin 


Pro 


Tyr 


Glu 


Phe 


Lys 


Trp 


Asn 


Lys 


65 










70 










75 










80 


He 


Pro 


Lys 


Glu 


Val 


Arg 


Asp 


Ser 


Tyr 


Tyr 


Tyr 


Asn 


Trp 


Pro 


Val 


Tyr 










85 










90 










95 




Phe 


Pro 






























(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO : 7 5 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D ) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
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(B) LOCATION: 1 . . 75 

(D) OTHER INFORMATION: / Ceres Seq. ID 1027884 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 5 : 



Met 


Asn 


He 


Trp 


He 


Gly 


Ser 


Ala 


Ala 


Phe 


Ser 


Val 


Leu 


Thr 


Trp 


Val 


1 








5 










10 










15 




Asn 


Pro 


Tyr 


Met 


Trp 


Gin 


Leu 


Ser 


Asn 


Gin 


Phe 


Asn 


Tyr 


His 


Asp 


Lys 








20 










25 










30 






Trp 


Met 


Leu 


Phe 


Glu 


Gin 


Tyr 


His 


Trp 


Lys 


Lys 


Ala 


Arg 


Ala 


Lys 


Lys 






35 










40 










45 








Gin 


Pro 


Tyr 


Glu 


Phe 


Lys 


Trp 


Asn 


Lys 


He 


Pro 


Lys 


Glu 


Val 


Arg 


Asp 




50 










55 










60 










Ser 


Tyr 


Tyr 


Tyr 


Asn 


Trp 


Pro 


Val 


Tyr 


Phe 


Pro 













65 70 75 

12) INFORMATION FOR SEQ ID NO: 76: 
■;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: DNA (genomic 
;ix) FEATURE: 

(A) NAME /KEY: - 

( B } LOCATION: 1 . . 305 

(D) OTHER INFORMATION: / Ceres Seq. ID 1381797 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 6 : 
gaggttttgg ttttgttgac tcagtgagga ttttctaggc gattttctcg agaaaarcgr 
ttgaggaaaa tggagacttc targaggtat accagcaatt ccaagtctat gaagatrcat 
gccaaagaga aggttccggt gaactcaaaa acccatttac agcttcatgg agagttagat 
actggaactg gggctccgag ttacttctgt gcgatgatta gacacttttt tcctgaggct 
tcaacaggcc ttggggtagg attgcattat gataagcgcc aaaagcttcg gtgtcttgta 
cgcgg 

(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 78 

(D) OTHER INFORMATION: / Ceres Seq. ID 1381798 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



Met 


Glu 


Thr 


Ser 


Met 


Arg 


Tyr 


Thr 


Ser 


Asn 


Ser 


Lys 


Ser 


Met 


Lys 


He 


1 








5 










10 










15 




His 


Ala 


Lys 


Glu 


Lys 


Val 


Pro 


Val 


Asn 


Ser 


Lys 


Thr 


His 


Leu 


Gin 


Leu 








20 










25 










30 






His 


Gly 


Glu 


Leu 


Asp 


Thr 


Gly 


Thr 


Gly 


Ala 


Pro 


Ser 


Tyr 


Phe 


Cys 


Ala 






35 










40 










45 








Met 


He 


Arg 


His 


Phe 


Phe 


Pro 


Glu 


Ala 


Ser 


Thr 


Gly 


Leu 


Gly 


Val 


Gly 




50 










55 










60 










Leu 


His 


Tyr 


Asp 


Lys 


Arg 


Gin 


Lys 


Leu 


Arg 


Cys 


Leu 


Val 


Arg 







65 70 75 

(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : peptide 
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(B) LOCATION: 1..7 4 

(D) OTHER INFORMATION: / Ceres Seq. ID 1381799 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
Met Arg Tyr Thr Ser Asn Ser Lys Ser Met Lys lie His Ala Lys Glu 
5 1 5 10 15 

Lys Val Pro Val Asn Ser Lys Thr His Leu Gin Leu His Gly Glu Leu 

20 25 30 

Asp Thr Gly Thr Gly Ala Pro Ser Tyr Phe Cys Ala Met lie Arg His 
35 40 45 

10 Phe Phe Pro Glu Ala Ser Thr Gly Leu Gly Val Gly Leu His Tyr Asp 
50 55 60 

Lys Arg Gin Lys Leu Arg Cys Leu Val Arg 
65 70 
(2) INFORMATION FOR SEQ ID NO : 7 9 : 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D) TOPOLOGY: linear 
2 0 Hi) MOLECULE TYPE: peptide 

;ix) FEATURE: 

(A) NAME/ KEY : peptide 
(5) LOCATION: I . . 65 



v 



OTHER INFORMATION: / Ceres Seq. ID 1381800 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 9 : 

Met Lys lie His Ala Lys Glu Lys Val Pro Val Asn Ser Lys Thr His 
1 " 5 10 15 

Leu Gin Leu His Gly Glu Leu Asp Thr Gly Thr Gly Ala Pro Ser Tyr 
20 25 30 

30 Phe Cys Ala Met lie Arg His Phe Phe Pro Glu Ala Ser Thr Gly Leu 
35 40 45 

Gly Val Gly Leu His Tyr Asp Lys Arg Gin Lys Leu Arg Cys Leu Val 
50 55 60 

Arg 
35 65 

(2) INFORMATION FOR SEQ ID NO : 8 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 
(5) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
45 (B) LOCATION: 1..840 

(D) OTHER INFORMATION: / Ceres Seq. ID 1442747 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
acacgaaacg attccgcttc tccgcttcct tcataaatat ctcactcatc gtcactacat 60 
tgttctttca cacgcaattt tcatttctct cttaaccgta aacgaatctc tcttctcaaa 120 
50 gttttatttt ctctctgcga tggctcaaga agatgttact gctgttgcta ctaacggtgc 180 
tggtccggtg gagacgcatc ttgtcttcac tgagttcaag cagatgttgc tcgttgaagc 240 
tcaaaaggtc ggtgacgctg ttactttcta caaatctgct tttggtgcga tcgagtctgg 300 
tcattctctt taccctaagc gtaacttgac caagagcttc ctcatgttct ctcttctgag 360 
cttaatctcg ctggctcttc cttcgttgtt tgcgacgttt cctctctccc tggtttttct 420 
55 actgcgaaat cggaaggttc gggagtgact tttcttctcg gaactaagga tgctgaagcc 480 
gccgttgcga aagctgttga cgccggagct gtgaaagtgg aggttacgga ggcagaagtt 54 0 

gaactgggat tcaaaggaaa agttacggat ccttttggtg tcacttggat cttcgcggag 600 
aagaagaccg tgatcaccga cgagaacaaa gaggtttaga atctgtcgtc ggatctaatt 660 
cttcgtcggt ttctgaacaa aaaaaaatta tctattatcc taggtttatg cttttatttt 720 
60 gtcttttgtg gaatccggta atcagtaaac cggatcgagt aggctaatga ctttcggatt 780 
ctaatttcac cccttttgac aaactctaat cttggtgaaa ttgcatatta atctcgggct 840 
(2) INFORMATION FOR SEQ ID NO: 81: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
tii) MOLECULE TYPE: peptide 
,'ix) FEATURE: 

(A) NAME/KEY: peptide 
(*) LOCATION: 1 . . 148 

(D) OTHER INFORMATION: / Ceres Seq. ID 1442748 
ixi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 1 
His ^lu Thr lie Pro Leu Leu Arg Phe Leu His 
15 10 
Arg His Tyr lie Val Leu Ser His Ala lie Phe 

20 25 
Val Asn Glu Ser Leu Phe Ser Lys Phe Tyr Phe 

35 40 
Gin Glu Asp Val Thr Ala Val Ala Thr Asn Gly 

r j 5 5 

Thr His Leu Val Phe Thr Glu Phe Lys Gin Met 
o 7 0 7 5 

Gin Lys Val Gly Asp Ala Val Thr Phe Tyr Lys 

85 90 
lie Glu Ser Gly His Ser Leu Tyr Pro Lys Arg 

100 105 
Phe Leu Met Phe Ser Leu Leu Ser Leu lie Ser 

115 120 
Leu Phe Ala Thr Phe Pro Leu Ser Leu Val Phe 

130 135 
Lys Val Arg Glu 
145 

(2) INFORMATION FOR SEQ IP NO: 82: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
{ D) TOPOLOGY: linear 

■ii) MOLECULE TYPE: peptide 
.ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 1 0-2 

(D) OTHER INFORMATION: / Ceres Seq. ID 1442749 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



Met 


Ala 


Gin 


Glu 


Asp 


Val 


Thr 


Ala 


Val 


Ala 


Thr 


Asn 


Gly 


Ala 


Gly 


Pro 


1 








5 










10 










15 




Val 


Glu 


Thr 


His 


Leu 


Val 


Phe 


Thr 


Glu 


Phe 


Lys 


Gin 


Met 


Leu 


Leu 


Val 








20 










25 










30 






Glu 


Ala 


Gin 


Lys 


Val 


Gly 


Asp 


Ala 


Val 


Thr 


Phe 


Tyr 


Lys 


Ser 


Ala 


Phe 






35 










40 










45 








Gly 


Ala 


He 


Glu 


Ser 


Gly 


His 


Ser 


Leu 


Tyr 


Pro 


Lys 


Arg 


Asn 


Leu 


Thr 




50 










55 










60 










Lys 


Ser 


Phe 


Leu 


Met 


Phe 


Ser 


Leu 


Leu 


Ser 


Leu 


He 


Ser 


Leu 


Ala 


Leu 


65 










70 










75 










80 


Pro 


Ser 


Leu 


Phe 


Ala 


Thr 


Phe 


Pro 


Leu 


Ser 


Leu 


Val 


Phe 


Leu 


Leu 


Arg 










85 










90 










95 




Asn 


Arg 


Lys 


Val 


Arg 


Glu 























100 

(2) INFORMATION FOR SEQ ID NO: 83: 
fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 



Lys Tyr Leu Thr His 
15 

He Ser Leu Leu Thr 
30 

Leu Ser Ala Met Ala 
45 

Ala Gly Pro Val Glu 

60 

Leu Leu Val Glu Ala 

8 0 

Ser Ala Phe Glv Ala 
95~ 

Asn Leu Thr Lys Ser 
110 

Leu Ala Leu Pro Ser 
125 

Leu Leu Arg Asn Arg 
140 
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(D) TOPOLOGY; linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
5 (B) LOCATION: 1 . . 74 

(D) OTHER INFORMATION: / Ceres Seq. ID 1442750 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
Met Leu Leu Val Glu Ala Gin Lys Val Gly Asp Ala Val Thr Phe Tyr 
1 5 10 15 

10 Lys Ser Ala Phe Gly Ala lie Glu Ser Gly His Ser Leu Tyr Pro Lys 

20 25 30 

Arg Asn Leu Thr Lys Ser Phe Leu Met Phe Ser Leu Leu Ser Leu lie 

35 40 45 

Ser Leu Ala Leu Pro Ser Leu Phe Ala Thr Phe Pro Leu Ser Leu Val 
15 50 55 60 

Phe Leu Leu Arg Asn Arg Lys Val Arg Glu 
65 70 
(2) INFORMATION FOR SEQ ID MO : 8 4 : 
:'i) SEQUENCE CHARACTERISTICS: 
2 0 A) LENGTH: 513 base pairs 

.3} TYPE: nucleic acid 

(C) STRANDEDNESS: single 
i D } TOPOLOGY: linear 

(ii; MOLECULE TYPE: DNA (genomic) 
2 5 (ix; FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 513 

(D) OTHER INFORMATION: / Ceres Seq. ID 1459199 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

30 aacttggatc ttaacacaga gaagaagcac aatcggaaga aaggagaaga cgaagatgtc 60 
gttggtatgg ctggaagcga tgttgcctct cggaatcatc ggtgggatgc tctgtatcat 120 
gggcaattct cagtactaca tccacaaagc ttatcatggc cgtcctaagc acatcggcca 180 
cgatgaatgg gatgttgcta tggaaagacg cgacaagaaa gtcgtcgaga aagctgcagc 240 
tccttcctca tgattcgctt tatctctttt gtgttcctca ggtgaccact tgtggtgaca 300 

35 aataaagtgc attccagaag aagaagaagc tgggggatct agtactttca ttcccatttg 360 
attttccttg gacatattaa agctttcaga aatcagacct caataacatt tggtttatca 420 
atatttctct antcgtgata ttttgtatgc ttttaagttg catgatacac tgaactactc 480 
tcttgctgrg -gagtgaata aatgaatatc tgc 
12) INFORMATION FOR SEQ ID NO : 8 5 : 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
45 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..83 

(D) OTHER INFORMATION: / Ceres Seq. ID 1459200 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 5 : 

Thr Trp lie Leu Thr Gin Arg Arg Ser Thr lie Gly Arg Lys Glu Lys 
1 5 10 15 

Thr Lys Met Ser Leu Val Trp Leu Glu Ala Met Leu Pro Leu Gly lie 
20 25 30 

55 lie Gly Gly Met Leu Cys lie Met Gly Asn Ser Gin Tyr Tyr lie His 
35 40 45 

Lys Ala Tyr His Gly Arg Pro Lys His lie Gly His Asp Glu Trp Asp 

50 55 60 

Val Ala Met Glu Arg Arg Asp Lys Lvs Val Val Glu Lvs Ala Ala Ala 
60 65 70" 75 " 80 

Pro Ser Ser 



WO 00/40695 



PCT/US00/00466 



632 

(2) INFORMATION FOR SEQ ID NO : 8 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
10 (B) LOCATION: 1..65 

(D) OTHER INFORMATION: / Ceres Seq. ID 1459201 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 6 : 
Met Ser Leu Val Trp Leu GIu Ala Met Leu Pro Leu Gly lie lie Gly 
1 5 10 15 

15 Gly Met Leu Cys lie Met Gly Asn Ser Gin Tyr Tyr lie His Lys Ala 

20 25 30 

Tyr His Gly Arg Pro Lys His lie Gly His Asp Glu Trp Asp Val Ala 

3 5 4 0 4 5 

Met Glu Arg Arg Asp Lys Lys Val Val Glu Lys Ala Ala Ala Pro Ser 

2 0 5 0 5 5 60 

Ser 
65 

(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

3 0 (ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 57 

(D) OTHER INFORMATION: / Ceres Seq. ID 1459202 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

35 



40 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 0 (ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..114 0 

(D) OTHER INFORMATION: / Ceres Seq. ID 1565605 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 8 : 

55 atgatttgtt tacatttttt cgtggtgttc ttgatttttt tctttgatca aaactagtag 60 
aaattttcca atttcgtcct gccttgtgat tagtagtaga tattttggaa tttgattagt 120 
aggtagaaga agatggggac gacgttagat gtatcaagag cagagctagc acttgtggta 180 
atgtatttga acaaagcaga ggcaagagat aagttatgca gagctataca gtatggttcc 240 
aagttcttga gtggtggaca acctggtact gctcaaaatg ttgacaaatc tactagctta 300 

60 gcaagaaaag tcttccgtct tttcaagttt gtgaatgact tgcatggtct tatcagtcct 360 
gtgcctaaag gaactcctct tcctcttgtt ttacttggaa agtcgaagaa cgcactttta 420 
tctacattct tgttcctgga tcaaattgtc tggcttggga gatcaggaat atataagaac 480 



Met 


Leu Pro 


Leu 


Gly 


lie 


He 


Gly Gly 


Met 


Leu 


Cys 


He 


Met 


Gly 


Asn 


1 






5 








10 










15 




Ser 


Gin Tyr 


Tyr 


lie 


His 


Lys 


Ala Tyr 


His 


Gly 


Arg 


Pro 


Lys 


His 


He 






20 








25 










30 






Gly 


His Asp 


Glu 


Trp 


Asp 


Vai 


Ala Met 


Glu 


Arg 


Arg 


Asp 


Lys 


Lys 


Val 




35 










40 








45 








Val 


Glu Lys 


Ala 


Ala 


Ala 


Pro 


Ser Ser 


















50 








55 


















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO : 8 8 : 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 
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aaagaacgag ctgagttact tggacgtata tctctcttct gctggatggg atcttctgtc 540 
tgcacaactt tagtcgaggt tggtgagatg ggaaggcttt cttcatcaat gaagaagatc 600 
gaaaagggac tcaagaatgg aaacaagtat caggatgagg attatcgtgc taagctaaaa 660 
aaatcaaacg agaggtcact tgctttgatc aaatcagcta tggacattgt tgtagcagct 720 
ggtcttcttc agttagctcc aacgaagatc actcctcgtg tcaccggagc ttttggattc 780 
atcacctcca tcatttcttg ttaccagttg cttccgacac gccccaagat caaaacaccc 840 
tgaagtcaat cgaggaagct ggtgttaagg agaaagtcat ttcagaaagt ttctgttctt 900 
cttattatta aggaaattca ctgctgttat aaaacatcgt ataaaatact tatttgcagg 960 
aattacggtt attataaatt tatgcctcag attcttgata cacaagtttc aatactcaat 1020 
agtattgcat tattatgaga tgttgtttct Gctacgcaat aactagaggc aaaatccaac 1080 
tacatatatg ttttgatctg attattataa tagactttgg gcttcattcc gcctaaagct 1140 
(2) INFORMATION FOR SEQ ID NO : 8 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY: linear 
peptide 



(ii) 
' ix} 



MOLECULE TYPE: 

FEATURE: 

(A) NAME /KEY : 
{ 5 J LOCATION: 



peptide 
1. .236 

[D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO : 8 9 : 



1565606 



Met 


Gly 


Thr 


Thr 


Leu 


Asp 


Val 


Ser 


Arg 


Ala 


Glu 


Leu 


Ala 


.Leu 


Val 


Val 


1 








5 










10 










15 




Met 


Tyr 


Leu 


Asn 


Lys 


Ala 


Glu 


Ala 


Arg 


Asp 


Lys 


Leu 


Cys 


Arg 


Ala 


He 








20 










25 " 










30 ' 






Gin 


Tyr 


Gly 


Ser 


Lys 


Phe 


Leu 


Ser 


Gly 


Gly 


Gin 


Pro 


Gly 


Thr 


Ala 


Gin 






35 










40 










45^ 








Asn 


Val 


Asp 


Lys 


Ser 


Thr 


Ser 


Leu 


Ala 


Arg 


Lys 


Val 


Phe 


Arg 


Leu 


Phe 




50 










55 










60 










Lys 


Phe 


Val 


Asn 


Asp 


Leu 


His 


Gly 


Leu 


He 


Ser 


Pro 


Val 


Pro 


Lys 


Gly 


65 










70 










75 










80 


Thr 


Pro 


Leu 


Pro 


Leu 


Val 


Leu 


Leu 


Gly 


Lys 


Ser 


Lys 


Asn 


Ala 


Leu 


Leu 










85 










90 










95 




Ser 


Thr 


Phe 


Leu 


Phe 


Leu 


Asp 


Gin 


He 


Val 


Trp 


Leu 


Gly 


Arg 


Ser 


Gly 








100 










105 










110 






He 


T vr 


Lys 


Asn 


Lys 


Glu 


Arg 


Ala 


Glu 


Leu 


Leu 


Gly 


Arg 


He 


Ser 


Leu 






115 










120 










125 








Phe 


Cys 


Trp 


Met 


Gly 


Ser 


Ser 


Val 


Cys 


Thr 


Thr 


Leu 


Val 


Glu 


Val 


Gly 




130 










135 










140 










Glu 


Met 


Gly 


Arg 


Leu 


Ser 


Ser 


Ser 


Met 


Lys 


Lys 


He 


Glu 


Lys 


Gly 


Leu 


145 










150 










155 










160 


Lys 


Asn 


Gly 


Asn 


Lys 


Tyr 


Gin 


Asp 


Glu 


Asp 


Tyr 


Arg 


Ala 


Lys 


Leu 


Lys 










165 










170 










175 




Lys 


Ser 


Asn 


Glu 


Arg 


Ser 


Leu 


Ala 


Leu 


He 


Lys 


Ser 


Ala 


Met 


Asp 


He 








180 










185 










190 






Val 


Val 


Ala 


Ala 


Gly 


Leu 


Leu 


Gin 


Leu 


Ala 


Pro 


Thr 


Lys 


He 


Thr 


Pro 






195 










200 










2 0 5 








Arg 


Val 


Thr 


Gly 


Ala 


Phe 


Gly 


Phe 


He 


Thr 


Ser 


He 


He 


Ser 


Cys 


Tyr 




210 










215 










220 










Gin 


Leu 


Leu 


Pro 


Thr 


Arg 


Pro 


Lys 


He 


Lys 


Thr 


Pro 










225 










230 










235 












(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO: 90 : 

















(i) 



(ii) 
(ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 
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(B) LOCATION: 1, .22 0 

(D) OTHER INFORMATION: ./ Ceres Seq. ID 1565607 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
Met Tyr Leu Asn Lys Ala Glu Ala Arg Asp Lys Leu Cys Arg Ala lie 
5 1 5 10 15 

Gin Tyr Gly Ser Lys Phe Leu Ser Gly Gly Gin Pro Gly Thr Ala Gin 

20 25* 30 

Asn Val Asp Lys Ser Thr Ser Leu Ala Arg Lys Val Phe Arg Leu Phe 
35 40 45 

10 Lys Phe Val Asn Asp Leu His Gly Leu lie Ser Pro Val Pro Lys Gly 
50 55 " 60 

Thr Pro Leu Pro Leu Val Leu Leu Gly Lys Ser Lys Asn Ala Leu Leu 
65 7 0 "75 80 

Ser Thr Phe Leu Phe Leu Asp Gin lie Val Trp Leu Gly Arg Ser Gly 
15 35 90 95 

lie Tyr Lys Asn Lys Glu Arg Ala Glu Leu Leu Gly Arg lie Ser Leu 

100 105 110 

Phe Cys Trp Met Gly Ser Ser Val Cys Thr Thr Leu Val Glu Val Gly 
115 120 125 

20 Glu Met Gly Arg Leu Ser Ser Ser Mer Lys Lys lie Glu Lys Gly Leu 
130 135 140 

Lys Asn Gly Asn Lys Tyr Gin Asp Glu Asp Tyr Arg Ala Lys Leu Lys 
145 150 155 160 

Lys Ser Asn Glu Arg Ser Leu Ala Leu lie Lys Ser Ala Met Asp lie 
25 165 170 175 

Val Val Ala Ala Gly Leu Leu Gin Leu Ala Pro Thr Lys lie Thr Pro 

180 185 190 

Arg Val Thr Gly Ala Phe Gly Phe lie Thr Ser lie lie Ser Cys Tyr 
195 200 205 

30 Gin Leu Leu Pro Thr Arg Pro Lys lie Lys Thr Pro 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 91: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
fix J FEATURE: 
4 0 (A) NAME /KEY : - 

(B) LOCATION: 1 . . 533 

{ D ) OTHER INFORMATION: / Ceres Seq. ID 1566686 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
atgattggtg gtttcttcgc cgatgttaaa ttcaacagat cgagagagaa aaaactctct 60 
45 ctccctcctc cgttcaccgt cgcgactcgg ctctacagta cattttgttg agcaatctcg 120 
tatttagagg attccaatgt ctggcgcaga ggataataaa tcttcccatg ctcagctttc 180 
gtcccaaatt tttcttgatc ttgtggattc ggtgatagct gatgtagcat ctgagtgtca 240 
tcgggtagca cgactaggac ttgatcgtga tttggatata gtagaagaag agttgagatt 300 
grcagtggaa gcgcgtgcga agatcgctga tcctagcaac aacctcgaaa ccaacaccaa 360 
50 atatgttgtt gatatatttg ggcagactca ccctcctgta gctagtgaag tgttcaattg 420 
catgaactgt gggcgacaaa ttgttgcagg aaggtttgct cctcatctgg aaaaatgcat 480 
gggaaaggga agaaaggctc gtgGccaagG actaccagaa gcacaacggc tgc 
(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

60 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .132 
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10 



25 



30 



35 



40 



45 



50 



55 



60 



635 

(D) OTHER INFORMATION: / Ceres Seq. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
Met Ser Gly Ala Glu Asp Asn Lys Ser Ser His Ala Gin Leu Ser Ser 
15 10 15 

Gin lie Phe Leu Asp Leu Val Asp Ser Val lie Ala Asp Val Ala Ser 

20 25 30 

Glu Cys His Arg Val Ala Arg Leu Gly Leu Asp Arg Asp Leu Asp lie 

35 40 45 

Val Glu Glu Glu Leu Arg Leu Ser Val Glu Ala Arg Ala Lys lie Ala 

50 55 60 

Asp Pro Ser Asn Asn Leu Glu Thr Asn Thr Lys Tyr Val Val Asp lie 
65 70 75 80 

Phe Gly Gin Thr His Pro Pro Val Ala Ser Glu Val Phe Asn Cys Met 

85 90 95 

Asn Cys Gly A.rg Gin lie Val Ala Gly Arg Phe Ala Pro His Leu Glu 

100 105 110 

Lys Cys Met Gly Lys Gly Arg Lys Ala Arg Gly Gin Gly Leu Pro Glu 
115 120 125 

Gin Arg Leu 
130 

INFORMATION FOR SEQ ID NO: 93: 



Ala 



(2) 



(i) 



(ii) 
( ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: 
MOLECULE TYPE: 
FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



1 mear 

DNA (genomic) 



ID 1567367 



(xi) 



1 . . 8 1 5 

(D) OTHER INFORMATION: / Ceres Seq. 
SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
gaaacaagac acgaaagaat ctctaaagaa atcattttca tcgcttcgct tcgatttttc 
ttcaatggct gtcgtcggcg ctccaatatc gtctccggcg gctcagctgc agacacaatt 
tctctccaat cccattctcc cccgctttcg ccggtctttc tccaccggaa aatcaccagc 
aactttctcc gtcgtagcta tggctcccca gaaaaaggtg aacaaatatg atgccaagtg 
gaagaaacaa tggtacggag ctggartgtt tttcgaaggg agtgagcaaa taaacgttga 
tgttttcaag aagctggaga agcgaaaagt gttgagcaac gttgagaaat ctggcctgct 
gtaaaagcag aggggttggg actcacattg tcatctcttg agaagcttaa agtcttctcc 
aaagcagagg accttggtct tctcagtctc cttgagaact tagctggamG AAcatcgcct 
gcggtcttag cctcggctgc attaccagct ctcacggctg ctattgtagc cgtggtgttg 
atcccgaatg actcaactac tctagtggtt gctcaggcgg ttttggccgg tgctcttgcg 
cttacagggg ttgttttgtt ggttggttct gttgttttgg atggacttca agaagctgac 
tgattctttc tctgtaaacc aaacataaac ccatgtcttg tccaattgat ttttgtcagt 
tgctgattta tagctgtatg gttcagttgt ttatggttag tccaagacat aagctgagtg 
atagaaagaa gctttataat aattaaacaa atatt 
(2) INFORMATION FOR SEQ ID NO: 94: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1. .76 

(D) OTHER INFORMATION: / Ceres Seq. ID 1567368 
SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
Glu Thr Arg His Glu Arg lie Ser Lys Glu lie lie Phe lie Ala Ser 
15 10 15 

Leu Arg Phe Phe Phe Asn Gly Cys Arg Arg Arg Ser Asn lie Val Ser 
20 25 30 



(ii) 
(ix) 



(xi) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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Gly Gly Ser Ala Ala Asp Thr lie Ser Leu Gin Ser His Ser Pro Pro 

35 40 45 

Leu Ser Pro Val Phe Leu His Arg Lys lie Thr Ser Asn Phe Leu Arg 

50 55 60 

Arg Ser Tyr Gly Ser Pro Glu Lys Gly Glu Gin lie 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .12 0 

(D) OTHER INFORMATION: / Ceres Seq. ID 1567369 
(xij SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
Lys Gin Asp Thr Lys Glu Ser Leu Lys Lys Ser Phe Ser Ser Leu Arg 
15 10 15 

Phe Asp Phe Ser Ser Met Ala Val Val Gly Ala Pro lie Ser Ser Pro 

2 0 2 5 3 0 

Ala Ala Gin Leu Gin Thr Gin Phe Leu Ser Asn Pro lie Leu Pro Arg 

35 40 45 

Phe Arg Arg Ser Phe Ser Thr Gly Lys Ser Pro Ala Thr Phe Ser Val 

5 0 5 5 60 

Val Ala Met Ala Pro Gin Lys Lys Val Asn Lys Tyr Asp Ala Lys Trp 
65 70 75 80 

Lys Lys Gin Trp Tyr Gly Ala Gly Leu Phe Phe Glu Gly Ser Glu Gin 

85 90 95 

lie Asn Val Asp Val Phe Lys Lys Leu Glu Lys Arg Lys Val Leu Ser 

100 135 110 

Asn Val Glu Lys Ser Gly Leu Leu 
115 12 0 

(2) INFORMATION FOR SEQ ID MO : 9 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 99 

(D) OTHER INFORMATION: / Ceres Seq. ID 1567370 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
Met Ala Val Val Gly Ala Pro lie Ser Ser Pro Ala Ala Gin Leu Gin 
15 10 15 

Thr Gin Phe Leu Ser Asn Pro lie Leu Pro Arg Phe Arg Arg Ser Phe 

20 25 30^ 

Ser Thr Gly Lys Ser Pro Ala Thr Phe Ser Val Val Ala Met Ala Pro 

35 40 45 

Gin Lys Lys Val Asn Lys Tyr Asp Ala Lys Trp Lys Lys Gin Trp Tyr 

50 55 60' 

Gly Ala Gly Leu Phe Phe Glu Gly Ser Glu Gin lie Asn Val Asp Val 
65 70 75 80 

Phe Lys Lys Leu Glu Lys Arg Lys Val Leu Ser Asn Val Glu Lys Ser 
85 90 95 

Gly Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 97: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 541 

(D) OTHER INFORMATION: / Ceres Seq. ID 1570101 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

aaaMctttac gttggtatta tctggtttca cattttctta accctctttg gtgaatgcgt 60 
cattgaaacg ccattatccg ccgtgctaat ctgatctcct cccccgtcgg aaaaaaiggc 120 
cactggtgga aaagtctcct tcaaagtcac tctgacttcg gatcctaagc ttcctttcaa 180 
agttttcagc gtaccagaag gagctccgtt cacggcggtt ctgaaattcg cagcagaaga 240 

15 gttcaaggtt cctccacaaa ccagcgccat catcactaat gatgggatcg ggatcaatcc 300 
tcaacagagt gcaggaaacg tttttctgaa gcacggatct gaactaagat r.gatcccgcg 360 
tgatagagtt ggagctgtgt ttgtgatgga tccatagatg atgtgtaagc aatgattaaa 420 
aactcagttc caaaaaaaaa acataataat atactgaaat gcttatgtat caactTcatt 480 
qaataaatat gtaatcatca catcctttta caaattaaag taagttaaat ttaatcaaat 540 

20 c 

2 , INFORMATION FOR SEQ ID MO: 98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
30 (B) LOCATION: 1 . . 93 

(D) OTHER INFORMATION: / Ceres Seq. ID 1570102 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
Met Ala Thr Gly Gly Lys Val Ser Phe Lys Val Thr Leu Thr Ser Asp 
1 5 10 15 

3 5 Pro Lys Leu Pro Phe Lys Val Phe Ser Val Pro Glu Gly Ala Pro Phe 

20 25 30 

Thr Ala Val Leu Lys Phe Ala Ala Glu Glu Phe Lys val Pro Pro Gin 

35 40 45 

Thr Ser Ala lie lie Thr Asn Asp Gly lie Gly lie Asn Pro Gin Gin 
40 50 55 60 

Ser Ala Gly Asn Val Phe Leu Lys His Gly Ser Glu Leu Arg Leu lie 
65 70 75 80 

Pro Arg Asp Arg Val Gly Ala Val Phe Val Met A.sp Pro 
85 90 

4 5 (2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 418 

55 (D) OTHER INFORMATION: / Ceres Seq. ID 1571051 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
aaaaggaaga cgacgatgca gagtttaacg tcaaacgaag tTgcgggttt ggcagttggg 60 
gctctgcttc ttggtgccac catcgctgct cctaaagttg atgcttttat cgctgcttct 120 
cagagaagat ctcttggcat gtgtcgaaaa tgtggagatc ttaagaatgt agcttgtggc 180 

60 cgttgcaaag gaacaggaac aatcaaatca ggaggattct ttggtttcag tgactcatca 240 
aacacaagat cagtggcttg cgataattgc caagccaaag gttgtttccc ttgccctgaa 300 
tgctcaaaat cttgaccatt ttctcggtat tttatagttg tttcatcttc ttgacactat 360 
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gataagtgta atcggtccat tggtaatggt aatgttaaag ttgaagaatg tcttgttt 
(2) INFORMATION FOR SEQ ID NO: 100: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D ) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
;ix) FEATURE: 

(A) NAME /KEY : peptide 

(3) LOCATION: 1 . . 104 

( D) OTHER INFORMATION: / Ceres Seq. ID 1571052 
i'xi) SEQUENCE DESCRIPTION: SEQ ID NO : 100: 



lvs 


Arg 


Lys 


Thr 


Thr 


Met 


Gin 


Ser 


Leu 


Thr 


Ser 


Asn 


Glu 


Val 


Ala 


Gly 










5 










10 










15 




Leu 


r--J_ a 


V a 1 


Gly 


Ala 


Leu 


Leu 


Leu 


Gly 


Ala 


Thr 


He 


i^j. a 


Ala 


Pro 


Lys 








2 0 










25^ 










30 






Val 


Asp 


Ala 


Fhe 


He 


Ala 


Ala 


Ser 


Gin 


Arg 


Arg 


Ser 


Leu 


Gly 


Met 


C*'S 






35 










40 


















Arg 


~_ y s 


Cys 


Gly 


Asp 


Leu 


Lys 


Asn 


Val 


Ala 


y s 


GI v 


Arg 


Cys 


Lys 


Gly 


Thr 


Gly 


Thr 


lie 


L y s 


Ser 


Gly 


Giy 


Phe 


Phe 


■o _l y 


Phe 


Ser 


Asp 


Ser 


Ser 


65 










70 










"7 £ 










80 


Asn 


Thr 


Arg 


Ser 


Val 


Ala 


Cys 


Asp 


Asn 


Cys 


Gin 


Al a 


Lys 


Gly 


Cys 


Phe 










85 










90 










95 




Pro 


Cys 


Pro 


Glu 


Cys 


Ser 


Lys 


Ser 
























100 


























/ n \ 
V ^ / 


INFORMATION 


FOR 


SEQ 


ID 


NO: 101 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
fix) FEATURE: 

(A) NAME /KEY : peptide 

(3) LOCATION: 1 . . 99 

( D } OTHER INFORMATION: / Ceres Seq. ID 1571053 
i'xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



Met 


Gin 


Ser 


ueu 


Thr 


Ser 


Asn 


Glu 


Val 


Ala 


Gly 


Leu 


Ala 


Val 


Gly 


Ala 


1 








5 










10 










15 




Leu 


Leu 


Leu 


Gly 


Ala 


Thr 


He 


Ala 


Ala 


Pro 


Lys 


Val 


Asp 


Ala 


Phe 


lie 








20 










25 










30 






Ala 


Ala 


Ser 


Gin 


Arg 


Arg 


Ser 


Leu 


Gly 


Met 


Cys 


Arg 


Lvs 


Cys 


Gly 


Asp 






35 










40 










4 5 








Leu 


Lys 


Asn 


Val 


Ala 


Cys 


Gly 


Arg 


Cys 


Lys 


Gly 


Thr 


Gly 


Thr 


He 


Lys 




50 










55 










60 










Ser 


Gly 


Gly 


Phe 


Phe 


Gly 


Phe 


Ser 


Asp 


Ser 


Ser 


Asn 


Thr 


Arg 


Ser 


Val 


65 










70 










75 










80 


Ala 


Cys 


Asp 


Asn 


Cys 


Gin 


Ala 


Lys 


Gly 


Cys 


Phe 


Pro 


Cys 


Pro 


Glu 


Cys 



85 90 95 

Ser Lys Ser 

(2) INFORMATION FOR SEQ ID NO: 102: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
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(B) LOCATION: 1 . . 58 

(D) OTHER INFORMATION: / Ceres Seq. ID 1571054 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
Met Cys Arg Lys Cys Gly Asp Leu Lys Asn Val Ala Cys Gly Arg Cys 
5 1 5 10 15 

Lys Gly Thr Gly Thr He Lys Ser Gly Gly Phe Phe Gly Phe Ser Asp 

20 25 30 

Ser Ser Asn Thr Arg Ser Val Ala Cys Asp Asn Cys Gin Ala Lys Gly 
35 40 45 

10 Cys Phe Pro Cys Pro Glu Cys Ser Lys Ser 
50 55 
(2) INFORMATION FOR SEQ ID NO: 103: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
fix) FEATURE : 
2 0 (A) NAME/ KEY : - 

(3) LOCATION: 1 . . 444 

(D) OTHER INFORMATION: / Ceres Seq. ID 1571100 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
aatctacttt tgagttctct ggagcagctt tggtcttttc tctcttcagt tacagctcgc 60 
25 gatccagatc catatccagt catggcgggt gtagtagcaa gaattgtgga aaagtttggt 120 
aaacgtgata tttggtatcg aatatcgttg atatcaattc cagtctctag tgtcctaata 180 
gtactaggga atatggagct tgctcgtatc agagaggaga agaagctttt gaaggagagg 240 
tttgatcaac tgagggctag aggaataatc agagatcagt aatccaaaag tgtgtttagg 300 
ctttgttttt ttgtttcttt tct.atgaatt taatttttta tttctccttt ccatgaatta 360 
30 accaactcta tgtaacgaga tgatttattc ttgtctctct aagactcgag atgattcgct 420 
attgataaaa cttctattat tScc 
(2) INFORMATION FOR SEQ ID NO: 104* 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 0 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 93 

(D) OTHER INFORMATION: / Ceres Seq. ID 1571101 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Asn 

45 



50 



Asn 


Leu 


Leu 


Leu 


Ser 


Ser 


Leu 


Glu 


Gin 


Leu 


Trp 


Ser 


Phe 


Leu 


Ser 


Ser 


1 








5 










10 










15 




Val 


Thr 


Ala 


Arg 


Asp 


Pro 


Asp 


Pro 


Tyr 


Pro 


Val 


Met 


Ala 


Gly 


Val 


Val 








20 










25 










30 






Ala 


Arg 


He 


Val 


Glu 


Lys 


Phe 


Gly 


Lys 


Arg 


Asp 


He 


Trp 


Tyr 


Arg 


He 






35 










40 










45 








Ser 


Leu 


He 


Ser 


He 


Pro 


Val 


Ser 


Ser 


Val 


Leu 


He 


Val 


Leu 


Gly 


Asn 




50 










55 










60 










Met 


Glu 


Leu 


Ala 


Arg 


He 


Arg 


Glu 


Glu 


Lys 


Lys 


Leu 


Leu 


Lys 


Glu 


Arg 


65 










70 










75 










80^ 


Phe 


Asp 


Gin 


Leu 


Arg 


Ala 


Arg 


Gly 


He 


He 


Arg 


Asp 


Gin 









55 85 90 

(2) INFORMATION FOR SEQ ID NO: 105: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
60 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..66 

(D) OTHER INFORMATION: / Ceres Seq. ID 1571102 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Met Ala Gly Val Val Ala Arg lie Val Glu Lys Phe Gly Lys Arg Asp 
15 10 15 

lie Trp Tyr Arg lie Ser Leu lie Ser lie Pro Val Ser Ser Val Leu 
20 25 30 

10 lie Val Leu Gly Asn Met Glu Leu Ala Arg lie Arg Glu Glu Lys Lys 
35 40 45 

Leu Leu Lys Glu Arg Phe Asp Gin Leu Arg Ala Arg Gly lie lie Arg 

50 " 5 5 60 

Asp Gin 
15 65 

(2) INFORMATION FOR SEQ ID NO: 10 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

( D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DMA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
25 (B) LOCATION: 1..636 

(D) OTHER INFORMATION: / Ceres Seq. ID 1665272 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
cagaataaaa atctatttct tgtgtttgtg ttatatttac ttaaaaaaat aaaggagatc 60 
agtttgggag aaagcagcaa agaagaaaaa tggcagcaac atttgcaaca ccatcgacgg 120 
30 tgataggcct cggaggatca tccatcacca ccaaaccctt ctcttcatcc tttttaaaac 180 
caacattaag cgccaagaac cctttgagac tcgccggtgc atcgggagga agagtcactt 240 
gctttgagag gaactggttg aggagagatt tgaacgtggt aggatttggg ctgatcggat 300 
ggctagctcc gtcgagcatt ccagcgataa atgggaagag cctgacgggt ctcttcttcg 360 
atagcatcgg aactgagctc gctcacttcc cgactcctcc agctctcact tcacagttct 420 
35 ggttgtggtt ggttacgtgg cacttaggcc tcttcctctg cctcactttc ggacaaatcg 480 
gattcaaggg caggactgag gattacttct aaggataact attcttgttt tcgtttgtac 540 
tatatgctct ctcttggtta tgtgtaatat tatcaatcaa aacaaagctt tctttgctat 600 
ttgatgtttc attctatatc tcgacaAgtc tttctt 
(2) INFORMATION FOR SEQ ID NO: 107: 
4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
45 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . . 140 

(D) OTHER INFORMATION: / Ceres Seq. ID 1665273 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

Met Ala Ala Thr Phe Ala Thr Pro Ser Thr Val lie Gly Leu Gly Gly 
15 10 15 

Ser Ser lie Thr Thr Lys Pro Phe Ser Ser Ser Phe Leu Lys Pro Thr 
20 25 30 

55 Leu Ser Ala Lys Asn Pro Leu Arg Leu Ala Gly Ala Ser Gly Gly Arg 
35 40 45 

Val Thr Cys Phe Glu Arg Asn Trp Leu Arg Arg Asp Leu Asn Val Val 

50 55 60 

Glv Phe Gly Leu lie Gly Trp Leu Ala Pro Ser Ser lie Pro Ala lie 
60 65 70 75 80 

Asn Gly Lys Ser Leu Thr Gly Leu Phe Phe Asp Ser lie Gly Thr Glu 
85 90 95 
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Leu Ala His Phe Pro Thr Pro Pro Ala Leu Thr Ser Gin Phe Trp Leu 

100 105 110 

Trp Leu Val Thr Trp His Leu Gly Leu Phe Leu Cys Leu Thr Phe Gly 

115 120 125 

Gin lie Gly Phe Lys Gly Arg Thr Glu Asp Tyr Phe 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 108: 
(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: DNA (genomic) 
(ixi FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 522 

(D) OTHER INFORMATION: / Ceres Seq. ID 1713895 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



aaacttggat cttaacacag agaagaagca caatcggaag aaaggagaag acgaagatgt 60 

cgtt ggr arg gctggaagcg atgttgcctc tcggaarcat cggtgggatg ctctgtatca 120 

tgggcaattc tcagtactac arccacaaag cttatcatgg ccgtcctaag cacatcggcc 180 

acgatgaatg ggargttgct atggaaagac gcgacaagaa agtcgtcgag aaagctgcag 240 

ctccttcctc atgattcgct ttatctcttt tgtgttcctc aggggcttaa ggtgaccact 300 

tgtggtgaca aataaagtgc attccagaag aagaagaagc tgggggatct agtactttca 360 

ttcccatttg attttccttg gacatattaa agctttcaga aatcagacct caataacatt 420 

tggtttatca atatttctct attcgtgata ttttgtatgc ttttaagttg catgatacac 480 



tgaactactc tcttgctgtg tgagtgaata aatgaatatc tg 
(2) INFORMATION FOR SEQ ID NO: 109: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 83 

(D) OTHER INFORMATION: / Ceres Seq. ID 1713896 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



Thr 


Trp 


lie 


Leu 


Thr 


Gin 


Arg 


Arg 


Ser 


Thr 


He 


Gly 


Arg 


Lys 


Glu 


Lys 
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10 










15 
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Lys 


Met 
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Leu 


Val 


Trp 


Leu 


Glu 


Ala 


Met 


Leu 


Pro 


Leu 


Gly 


lie 








20 










25 










30 






lie 


Gly 


Gly 


Met 


Leu 


Cys 


He 


Met 


Gly 


Asn 


Ser 


Gin 


Tyr 


Tyr 


He 


His 






35 










40 










45 








Lys 


Ala 


Tyr 


His 


Gly 


Arg 


Pro 


Lys 


His 


He 


Gly 


His 


Asp 


Glu 


Trp 


Asp 




50 










55 










60 










Val 


Ala 


Met 


Glu 


Arg 


Arg 


Asp 


Lys 


Lys 


Val 


Val 


Glu 


Lys 


Ala 


Ala 


Ala 



65 70 75 80 

Pro Ser Ser 

(2) INFORMATION FOR SEQ ID NO: 110: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 65 

(D) OTHER INFORMATION: / Ceres Seq. ID 1713897 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
Met Ser Leu Val Trp Leu Glu Ala Met Leu Pro Leu Gly lie lie Gly 
15 10 15 

Gly Met Leu Cys He Met Gly Asn Ser Gin Tyr Tyr He His Lys Ala 

20 25 30 

Tyr His Gly Arg Pro Lys His He Gly His Asp Glu Trp Asp Val Ala 

3 5 4 0 4 5 

Met Glu Arg Arg Asp Lys Lys Val Val Glu Lys Ala Ala Ala Pro Ser 
50 55 60 

Ser 
65 

(2) INFORMATION FOR SEQ ID NO : 1 1 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
fix) FEATURE: 

' A } NAME /KEY : peptide 
•;B) LOCATION: 1 . . 5" 

(D) OTHER INFORMATION: / Ceres Seq. ID 1713898 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 111: 
Met Leu Pro Leu Gly lie He Gly Gly Met Leu Cys He Met Gly Asn 
1 5 10 15 

Ser Gin Tyr Tyr He His Lys Ala Tyr His Gly Arg Pro Lys His lie 

20 25 30 

Gly His A.sp Glu Trp Asp Val Ala Met Glu Arg A.rg A.sp Lys Lys Val 

35 40 45 

Val Glu Lys Ala Ala Ala Pro Ser Ser 

50 55 
(2) INFORMATION FOR SEQ ID NO: 112: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .74 5 

(D) OTHER INFORMATION: / Ceres Seq. ID 1923752 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
acgcttagga tttaaggact atcgttcttg gttgtggatt ctcgcccacc gtcttgctta 60 
cgagtcacac accaccgtcc ttggatcgag tcttcaactt aagctctgat gctaattcac 120 
atcaaagctg acttgctctg actttgtctc tcaaggattt gaggcgtgat tcaagaacaa 180 
aacaaattaa agaacaactt ttctttattt atttagaaac tactcaaaac taaatctctc 240 
taggtcttta gaacaactct ctaaagcctc tcaatctctt tcttgagact tatggaacaa 300 
ctctccatag tctctctata tagaactgag ctacccctaa gtctatatct catgaagtct 360 
ctctcgactc gcatgtttat ctcatgaggt ctctctcgac tctaaggaga tctctcttga 420 
ctcacatctt gagctctctc tttgatcctc ttggatgctt tagctctctt gtgttatctt 480 
cttcttcttc atagagatca tatatatact aggaccaaac acctcctttc cctattgtga 540 
ataggaatac ttacaaatct tatccaactt gacattattt gtatttcttt ttttcttgat 600 
ttcctttctt gaatttcttt tcctgtatct cttctagcat cttctagaag gagataactt 660 
gctcctcaag taagttgtaa ttatccaaca cgcatgtgat ctttaatgat cgtggggatt 720 
atttagtggc agagatgaat actct 
(2) INFORMATION FOR SEQ ID NO: 113: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: / Ceres Seq. ID 1923753 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
Arg Leu Gly Phe Lys Asp Tyr Arg Ser Trp Leu Trp lie Leu Ala His 
15 10 15 

Arg Leu Ala Tyr Glu Ser His Thr Thr Val Leu Gly Ser Ser Leu Gin 
20 25 30 

Leu Lys Leu 
35 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
■(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: 1. .31 

(D) OTHER INFORMATION: / Ceres Seq. ID 1923754 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
Met Glu Gin Leu Ser lie Val Ser Leu Tyr Arg Thr Glu Leu Pro Leu 
1 5 10 15 

Ser Leu Tyr Leu Met Lys Ser Leu Ser Thr Arg Met Phe lie Ser 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 115: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1..36 

(D) OTHER INFORMATION: / Ceres Seq. ID 1923755 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
Met Arg Ser Leu Ser Thr Leu Arg Arg Ser Leu Leu Thr His lie Leu 
1 5 10 15 

Ser Ser Leu Phe Asp Pro Leu Gly Cys Phe Ser Ser Leu Val Leu Ser 

20 25 30 

Ser Ser Ser Ser 
35 

(2) INFORMATION FOR SEQ ID NO : 1 1 6 : 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STPANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 487 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976816 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 6 : 
aaaccctagc aaaaacctct ctctctctct caattcactt ctcctcctac aatggcttgg 60 
cgcaacgcag gatctgctgc tcgttctttc gtctccgcca ccgcaagatc accatctctc 120 
cgttctccca ccacggcgct tcctcgcctc cgtcctcctc aatcctcctt acctagccgt 180 
cgcttcacct tctcgtcacc ttccaggaat ctaggagccc taggttgcac acagtcgttc 240 



WO 00/40695 



PCT/US00/00466 



644 

ttgcctctgt acagtgttgt ggctacttct caactgacat ctcaccttaa cgttaatttg 300 
cgagctttct gcgagctgtc taacggtact tgaagaggca aaagagtgta atttcactgc 360 
gcatgcgtga atcagtactt tgaagggatg tgagatcgaa attgtgggat tacctcagca 420 
aaaacttgct taatgtgttg cctgttttag aagttttagt ttcacaatgt gtggatttat 480 
5 tagttgt 

(2) INFORMATION FOR SEQ ID NO: 117: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
15 (B) LOCATION: 1 . . 70 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976817 



20 





(xi) 


SEQUENCE DESCRIPTION: SEQ 


ID NO:117 












Asn 


Pro 


Ser 


Lys 


Asn 


Leu 


Ser Leu Ser 


Leu Asn 


Ser 


Leu 


Leu 


Leu 


Leu 


1 








5 






10 








15 




Gin 


Trp 


Leu 


Gly 


Ala 


Thr 


Gin Asp Leu 


Leu Leu 


Val 


Leu 


Ser 


Ser 


Pro 








20 






25 








30 






Pro 


Pro 


Gin 


Asp 


His 


His 


Leu Ser Val 


Leu Pro 


Pro 


Arg 


Arg 


Phe 


Leu 






35 








40 






45 








Ala 


Ser 


Val 


Leu 


Leu 


Asn 


Pro Pro Tyr 


Leu Ala 


Val 


Ala 


Ser 


Pro 


Ser 




50 










55 




60 










Arg 


His 


Leu 


Pro 


Gly 


He 
















65 










70 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 118 : 















25 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

3 5 (ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 57 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976818 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
40 Thr Leu Ala Lys Thr Ser Leu Ser Leu Ser lie His Phe Ser Ser Tyr 
15 10 15 

Asn Gly Leu Ala Gin Arg Arg lie Cys Cys Ser Phe Phe Arg Leu Arg 

20 25 30 

His Arg Lys lie Thr lie Ser Pro Phe Ser His His Gly Ala Ser Ser 
45 35 40 45 

Pro Pro Ser Ser Ser lie Leu Leu Thr 

50 55 
(2) INFORMATION FOR SEQ ID NO: 119: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 5 (ix) FEATURE: 

(A) NAME /KEY : peptide 
<B) LOCATION: 1 . . 93 

(D) OTHER INFORMATION: / Ceres Seq. ID 1976819 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
60 Met Ala Trp Arg Asn Ala Gly Ser Ala Ala Arg Ser Phe Val Ser Ala 
15 10 15 

Thr Ala Arg Ser Pro Ser Leu Arg Ser Pro Thr Thr Ala Leu Pro Arg 
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20 25 30 

Leu Arg Pro Pro Gin Ser Ser Leu Pro Ser Arg Arg Phe Thr Phe Ser 

35 40 45 

Ser Pro Ser Arg Asn Leu Gly Ala Leu Gly Cys Thr Gin Ser Phe Leu 
5 50 55 60 

Pro Leu Tyr Ser Val Val Ala Thr Ser Gin Leu Thr Ser His Leu Asn 
65 70 75 80 

Val Asn Leu Arg Ala Phe Cys Glu Leu Ser Asn Gly Thr 
85 90 
10 (2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY : - 

(B) LOCATION: 1 . . 566 

20 (D) OTHER INFORMATION: / Ceres Seq. ID 2025128 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
atgaatcttc ttcgcttcgg ctccgcctct tattctcgag aaagctcatc ttqcttcrca 60 
catctaaaca aatatctcat ctcttctatt ctactttatc act teat etc cgtacgctct 120 
tctttcttcg ttgettctte ttttgattta taaattggat ccaaaacatc ataggctctc 180 
25 gtcatcctta tatacaagca ttgttacttt gtgatataaa gaacgaagaa ttgtacgaag 240 
gcaacgagca agatgagtag caaggegaat gaggtttaaa agagtagcaa ggcgaatgaa 300 
gttgaaagga ggaagatgaa tagcacggcc aatgacgatg aaaggatgaa taggatcatc 360 
agagatggee gtcgtcgtta cgtcacctgg agccgcagcc gcaacccctc gqcaqctcat 420 
atcttttggt caaaacccta atattttcta gggttgggct attagttttt gttttgggct 480 
30 tttttgaatt gtattttggg ctttttatta atttgtagtg ggtaatttgt ttggattagg 540 
atttttggta gtgtaatttt tttttt 
(2) INFORMATION FOR SEQ ID NO: 121: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 
35 (B) TYPE: amino acid 

tC) STRANDEDNESS: 

( D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 0 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 50 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025129 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
Met Asn Leu Leu Arg Phe Gly Ser Ala Ser Tyr Ser Arg Glu Ser Ser 
45 1 5 10 15 

Ser Cys Phe Ser His Leu Asn Lys Tyr Leu lie Ser Ser lie Leu Leu 

20 25 30 

Tyr His Phe lie Ser Val Arg Ser Ser Phe Phe Val Ala Ser Ser Phe 
35 40 45 

5 0 Asp Leu 

50 

(2) INFORMATION FOR SEQ ID NO: 122: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 
55 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

60 (A) NAME/KEY : peptide 

(B) LOCATION: 1 . . 47 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025130 
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20 



25 



30 



35 



40 



45 



50 



55 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
Glu Ser Ser Ser Leu Arg Leu Arg Leu Leu Phe Ser Arg Lys Leu lie 
1 5 10 15 

Leu Leu Leu Thr Ser Lys Gin lie Ser His Leu Phe Tyr Ser Thr Leu 

20 25 " 30 

Ser Leu His Leu Arg Thr Leu Phe Phe Leu Arg Cys Phe Phe Phe 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 123: 
(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: peptide 

(B) LOCATION: 1 . . 41 

(D) OTHER INFORMATION: / 
(xi!; SEQUENCE DESCRIPTION: SEQ 



Ser Thr A 



Asp Gl 
Ala Al 



(2} 



Arg Arg 

20 
His lie 
35 

N FORMAT I ON 



Asn Asp Asp 
Arcr Tvr Val Thr 



Ceres Seq 
ID MO: 123: 
Glu Arg Met Asn 
10 



ID 2025131 



Phe Trp Ser 



Trp 
25 
Pro 



Ser 



lie He 
1 5 

Asn Pro 
30 



Ser 



FOR SEQ 



Lys 
40 
NO : 1 2 4 : 



(ii) 
(ix) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



DNA (genomic) 



1 . . 5 4 5 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025402 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
tttttttttt ttttgcgaaa ttgaaatgat ctcacataaa aactcaattg ataattttac 
aaaggagaaa aatttgtcaa tcaaagaaga agagaaagat tgaactttta acctcacttg 
gtcataactt caacgccatt ctttccagcg ataatgacag aagtagccat tccgagaaac 
agaccatgct ccaccactcc ttgaaacttc ccaatctctt tagccgcagc gaatccatcc 
ttcaaaggag tcttaaaata caaatcaata atgtaattac tgttatcagt cacataaggc 
ttgccatcac catcaactct aagctttgat tcacatccaa attccttgaa gaggtcttgc 
aatctaatca aattaaagtt ccagcaqaat tqaacaactt ccaccggcat agctaatcca 
cttccaccga gtcctgtaac gagtttggta tcatcagcca caacaataaa cttgtcagcc 
acagcttcca ccattaccta gaacatccta atcgaaaaat tatccaatcc aatccattac 
ttact 

(2) INFORMATION FOR SEQ ID NO: 12 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
peptide 



60 
120 
180 
240 
300 
360 
420 
480 
540 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



Met 
1 

Ser 



peptide 
1 . . 47 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025403 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Leu His His Ser Leu Lys Leu Pro Asn Leu Phe Ser Arg Ser Glu 

5 10 15 

lie Leu Gin Arg Ser Leu Lys lie Gin lie Asn Asn Val lie Thr 



WO 00/40695 



PCT/US00/00466 



10 



15 
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20 25 30 

Val lie Ser His lie Arg Leu Ala lie Thr lie Asn Ser Lys Leu 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 12 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 33 

( D ) OTHER INFORMATION: / Ceres Seq. ID 2025404 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
Phe Phe Phe Phe Cys Glu lie Glu Met lie Ser His Lys Asn Ser lie 
15 10 15 

Asp Asn Phe Thr Lys Glu Lys Asn Leu Ser lie Lys Glu Glu Glu Lys 
20 25 30. 

Asp 



25 



30 



35 



40 



45 



50 



55 



60 



\2) INFORMATION FOR SEQ ID NO: 127: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 1954 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025479 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

atggaacctt atataaccag aagagataca gttaatcaag tgttacatgc agaagaagaa 60 

gaagaataca atgattgtga attggatcaa ctcttccttg tgcattctga tattagatct 120 

gttcttctcc agattgatga gcttgttgtt gaagccacaa agcgtaagac cgtgagcaag 180 

cacgggttaa tagaagttga atctttcaga actgtgttgt ctgatatgct ttcctcacta 240 

aagccatggt ttcctcgatt gcaagaagcg atgtctgatt ttcagttgct tccagaggat 300 

caggaagaac agtctttgat gagcacaaac gaagaagaag atttgtttga tgttgagagt 360 

cctgagccta ctcaatttga gcccttagtc tctccttctc ctcttgtgca ttggcgtggt 420 

gatcataacg cagacaaggg tagacaactc tttcttttga caccactacc cttaggaaaa 480 

tctgaattcc taaaacacca gaatgcatct aagcttacag ctaaaaggat ttttccagac 540 

actgttgcaa atgaaccgct tgaagcttct aaagaaacaa gtgatgatgt tttaggaggt 600 

gagtcgttga aaactgcagg acttggtaaa tctttagtcc atgccatgga tttctctgaa 660 

aatttggttg agtataagcc atgttcttca ccagttctca ggagaaagat tctgtctgag 720 

cttctaatga caccttgctt gaagctgtcg cctccgaaat cgtgtacaat gtttaaaccg 780 

gttcctgaat cttctcaact gggaaagcaa ggggcttgta agtctacctg ctctgaattg 840 

ggatcatctg gtattgaaaa aacagacaat ctctgctcga agtatcctga gctcttggga 900 

atacaacatg ctccgataac tagaaaaacg gatcttgaat cctcgccggt ttggtggttt 960 

tctcctccta aaacatgtgt tttaatggag cctgtgaatg agaagaaacc aattgatgaa 1020 

actggtggat cttttgatgt tccaaacatt ataccagaag caaaacacac tacagaggga 1080 

agcatgtcaa tggtggttga gagtactcca ttgtttaaag aaccagagag cataatgacg 1140 

agaaacagaa caaaagcagg tgagagtact ctgaagaaag agctatggac aagatttgaa 1200 

gaagcaacga ttcatgacag tcggttcaac tcgatgacaa caacaacaac agtgagagga 1260 

aataataaga aatgtttcat ggaaatgtta gaagaagtga gtggcaatga ggaagatcat 1320 

gagttaagtg gcgagtttgt ggctctaaaa ccttatcaag tactcatatc ggaactgagc 1380 

ataacaaacg cttttcattg tctgtctact cttattagct gctacataac tcaaagtaac 1440 

aaacatacac atataacata tagaaagatc gaaaacaaaa gatcgggatg agtatggctc 1500 

caaagacctc aactacactt gctttattcc ttgtgaccaa tattctcttc ctcaacctca 1560 

ttaccctgag ttgcgcagac aatacttgcc caagagacgt tctcaaactt tcgacatgct 1620 

cgaatgttct caacctcatc aacttgaagc tcggggcacc agctatgagg ccttgttgct 1680 

ctattctctt tggtctaatt gatctcgatg ttgcggtttg cctttgcacc gcgctcaagc 1740 
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tcagccttct tggcatcacc atcgacactc ctattcacct taacttggct cttaacgcct 1800 
gtggaggtac ccttcctgat ggattccgtt gcccaacata gctacaatat attcatattt 1860 
tctatctatg ttcttgaatt ggtttTgtct aacatccaaa ttcaaaccta aattcatgta 1920 
taagcaaata acaagtaaaa ttaagacttt tatg 
5 (2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 496 

15 (D) OTHER INFORMATION: / Ceres Seq. ID 2025480 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
Met Glu Pro Tyr lie Thr Arq Arg Asp Thr Val Asn Gin Val Leu His 
1 5 ' 10 15 

Ala Glu Glu Glu Glu Glu Tyr Asn Asp Cvs Glu Leu Asp Gin Leu Phe 
20 20 25~ " 30 

Leu Val His Ser Asp lie Arg Ser val Leu Leu Gin lie Asp Glu Leu 

35 40 45 

Val Val Glu Ala Thr Lys Arg Lys Thr Val Ser Lys His Gly Leu lie 
50 55 60 

25 Glu Val Glu Ser Phe Arg Thr Val Leu Ser Asp Met Leu Ser Ser Leu 
65 70 75 8C 

Lys Pro Trp Phe Pro Arg Leu Gin Glu Ala Met Ser Asp Phe Gin Leu 

85 90 95 

Leu Pro Glu Asp Gin Glu Glu Gin Ser Leu Met Ser Thr Asn Glu Glu 
30 100 105 110 

Glu Asp Leu Phe Asp Val Glu Ser Pro Glu Pro Thr Gin Phe Glu Pro 

115 120 125 

Leu Val Ser Pro Ser Pro Leu Val His Trp Arg Gly Asp His Asn Ala 
130 135 140 

35 Asp Lys Gly Arg Gin Leu Phe Leu Leu Thr Pro Leu Pro Leu Gly Lys 
145 150 155 160 

Ser Glu Phe Leu Lys His Gin Asn Ala Ser Lys Leu Thr Ala Lys Arg 

165 170 175 

lie Phe Pro Asp Thr Val Ala Asn Glu Pro Leu Glu Ala Ser Lys Glu 
40 180 185 190 

Thr Ser Asp Asp Val Leu Gly Gly Glu Ser Leu Lys Thr Ala Gly Leu 

195 200 205 

Gly Lys Ser Leu Val His Ala Met Asp Phe Ser Glu Asn Leu Val Glu 
210 215 220 

4 5 Tyr Lys Pro Cys Ser Ser Pro Val Leu Arg Arg Lys lie Leu Ser Glu 
225 230 235 240 

Leu Leu Met Thr Pro Cys Leu Lys Leu Ser Pro Pro Lys Ser Cys Thr 

245 ~" 250 255 

Met Phe Lys Pro Val Pro Glu Ser Ser Gin Leu Gly Lys Gin Gly Ala 
50 260 265 270 

Cys Lys Ser Thr Cys Ser Glu Leu Gly Ser Ser Gly lie Glu Lys Thr 

275 280 285 

Asp Asn Leu Cys Ser Lys Tyr Pro Glu Leu Leu Gly lie Gin His Ala 
290 ' 295 300 

55 Pro lie Thr Arg Lys Thr Asp Leu Glu Ser Ser Pro Val Trp Trp Phe 
305 310 315 ' 320 

Ser Pro Pro Lys Thr Cys Val Leu Met Glu Pro Val Asn Glu Lys Lys 

325 330 335 

Pro lie Asp Glu Thr Gly Gly Ser Phe Asp Val Pro Asn lie lie Pro 
60 340 ' 345 350 

Glu Ala Lys His Thr Thr Glu Gly Ser Met Ser Met Val Val Glu Ser 
355 360 365 
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Thr Pro Leu Phe Lys Glu Pro Glu Ser 

370 375 
Lys Ala Gly Glu Ser Thr Leu Lys Lys 
385 390 
Glu Ala Thr lie His Asp Ser Arg Phe 
4 05 

Thr Val Arg Gly Asn Asn Lys Lys Cys 
420 425 
Val Ser Gly Asn Glu Glu Asp His Glu 

4 35 44 0 

Leu Lys Pro Tyr Gin Val Leu lie Ser 

450 455 
Phe His Cys Leu Ser Thr Leu lie Ser 
465 470 
Lys His Thr His lie Thr Tyr Arg Lys 
485 

(2) INFORMATION FOR SEQ ID NO: 129: 
(i) SEQUENCE CHARACTERISTICS: 
I A) LENGTH: 421 amino aci 
(B) TYPE: amino acid 
iC) S T RAN DE DNE S S : 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1. . 421 
(D) OTHER INFORMATION: / Ceres Seq. ID 2025481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



Met 


Leu 


Ser 


Ser 


Leu 


Lys 


Pro 


Trp 


Phe 


Pro 


Arg 


Leu 


Gin 


Glu 


Ala 


Met 


1 








5 










10 










15 




Ser 


Asp 


Phe 


Gin 


Leu 


Leu 


Pro 


Glu 


Asp 


Gin 


Glu 


Glu 


Gin 


Ser 


Leu 


Met 








20 










25 










30 






Ser 


Thr 


Asn 


Glu 


Glu 


Glu 


Asp 


Leu 


Phe 


Asp 


Val 


Glu 


Ser 


Pro 


Glu 


Pro 






35 










40 










45 








Thr 


Gin 


Phe 


Glu 


Pro 


Leu 


Val 


Ser 


Pro 


Ser 


Pro 


Leu 


Val 


His 


Trp 


Arg 




50 










55 










60 










Gly 


Asp 


His 


Asn 


Ala 


Asp 


Lys 


Gly 


Arg 


Gin 


Leu 


Phe 


Leu 


Leu 


Thr 


Pro 


65 










7 0 










75 










80 


Leu 


Pro 


Leu 


Gly 


Lys 


Ser 


Glu 


Phe 


Leu 


Lys 


His 


Gin 


Asn 


Ala 


Ser 


Lys 










85 










90 










95 




Leu 


Thr 


Ala 


Lys 


Arg 


He 


Phe 


Pro 


Asp 


Thr 


Val 


Ala 


Asn 


Glu 


Pro 


Leu 








100 










105 










110 






Glu 


Ala 


Ser 


Lys 


Glu 


Thr 


Ser 


Asp 


Asp 


Val 


Leu 


Gly 


Gly 


Glu 


Ser 


Leu 






115 










120 










125 








Lys 


Thr 


Ala 


Gly 


Leu 


Gly 


Lys 


Ser 


Leu 


Val 


His 


Ala 


Met 


Asp 


Phe 


Ser 




130 










135 










140 










Glu 


Asn 


Leu 


Val 


Glu 


Tyr 


Lys 


Pro 


Cys 


Ser 


Ser 


Pro 


Val 


Leu 


Arg 


Arg 


145 










150 










155 










160 


Lys 


He 


Leu 


Ser 


Glu 


Leu 


Leu 


Met 


Thr 


Pro 


Cys 


Leu 


Lys 


Leu 


Ser 


Pro 










165 










170 










175 




Pro 


Lys 


Ser 


Cys 


Thr 


Met 


Phe 


Lys 


Pro 


Val 


Pro 


Glu 


Ser 


Ser 


Gin 


Leu 








180 










185 










190 






Gly 


Lys 


Gin 


Gly 


Ala 


Cys 


Lys 


Ser 


Thr 


Cys 


Ser 


Glu 


Leu 


Gly 


Ser 


Ser 






195 










200 










205 








Gly 


He 


Glu 


Lys 


Thr 


Asp 


Asn 


Leu 


Cys 


Ser 


Lys 


Tyr 


Pro 


Glu 


Leu 


Leu 




210 










215 










220 










Gly 


He 


Gin 


His 


Ala 


Pro 


He 


Thr 


Arg 


Lys 


Thr 


Asp 


Leu 


Glu 


Ser 


Ser 


225 










230 










235 










240 


Pro 


Val 


Trp 


Trp 


Phe 


Ser 


Pro 


Pro 


Lys 


Thr 


Cys 


Val 


Leu 


Met 


Glu 


Pro 










245 










250 










255 




Val 


Asn 


Glu 


Lys 


Lys 


Pro 


He 


Asp 


Glu 


Thr 


Gly 


Gly 


Ser 


Phe 


Asp 


Val 








260 










265 










270 







649 

He 

Glu 

Asn 
410 
Phe 

Leu 

Glu 

Cys 

He 
490 



Met Thr 
380 
Leu Trp 
395 

Ser Met 

Met Glu 

Ser Gly 

Leu Ser 
460 
Tyr He 
475 

Glu Asn 



Arg Asn Arg 

Thr Arg Phe 

Thr Thr Thr 
415 

Met Leu Glu 

430 
Glu Phe Val 
445 

He Thr Asn 

Thr Gin Ser 

Lys Arg Ser 
495 



Thr 

Glu 
400 
Thr 

Glu 

Ala 

Ala 

Asn 
480 
Gly 



ds 
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Pro Asn lie lie Pro Glu Ala Lys His Thr Thr Glu Gly Ser Met Ser 

275 280 285 

Met Val Val Glu Ser Thr Pro Leu Phe Lys Glu Pro Glu Ser lie Met 
290 295 300 

5 Thr Arg Asn Arg Thr Lys Ala Gly Glu Ser Thr Leu Lys Lys Glu Leu 
305 310 ^ 315 320 

Trp Thr Arg Phe Glu Glu Ala Thr He His Asp Ser Arg Phe Asn Ser 

325 330 335 

Met Thr Thr Thr Thr Thr Val Arg Gly Asn Asn Lys Lys Cys Phe Met 
10 340 345 " 350 

Glu Met Leu Glu Glu Val Ser Gly Asn Glu Glu Asp His Glu Leu Ser 

355 360 365 

Gly Glu Phe Val Ala Leu Lys Pro Tyr Gin Val Leu He Ser Glu Leu 
370 375 380 

15 Ser He Thr Asn Ala Phe His Cys Leu Ser Thr Leu He Ser Cys Tyr 
385 390 395 400 

He Thr Gin Ser Asn Lys His Thr His He Thr Tyr Arg Lys He Glu 

405 410 "* 415 

Asn Lys Arc Ser Gly 
20 420 

■25 INFORMATION FOR SEQ ID NO: 130: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 amino acids 

(B) TYPE: amino acid 

2 5 (C) STRANDEDNESS : 

CD) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

3 0 (B) LOCATION: 1 . . 406 

(D) OTHER INFORMATION: / Ceres Seq. ID 2025482 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 
Met Ser Asp Phe Gin Leu Leu Pro Glu Asp Gin Glu Glu Gin Ser Leu 
15 10 15 

35 Met Ser Thr Asn Glu Glu Glu Asp Leu Phe Asp Val Glu Ser Pro Glu 

20 " 25 30 

Pro Thr Gin Phe Glu Pro Leu Val Ser Pro Ser Pro Leu Val His Trp 

3 5 4 0 4 5 

Arg Gly A.sp His Asn Ala Asp Lys Gly Arg Gin Leu Phe Leu Leu Thr 

4 0 50~ 55~ 60 

Pro Leu Pro Leu Gly Lys Ser Glu Phe Leu Lys His Gin Asn Ala Ser 
65 70 75 80 

Lys Leu Thr Ala Lys A.rg He Phe Pro Asp Thr Val Ala Asn Glu Pro 
85 90 95 

4 5 Leu Glu Ala Ser Lys Glu Thr Ser Asp Asp Val Leu Gly Gly Glu Ser 

100 105 " 110 

Leu Lys Thr Ala Gly Leu Gly Lys Ser Leu Val His Ala Met Asp Phe 

115 120 125 

Ser Glu Asn Leu Val Glu Tyr Lys Pro Cys Ser Ser Pro Val Leu Arg 
50 130 135 140 

Arg Lys He Leu Ser Glu Leu Leu Met Thr Pro Cys Leu Lys Leu Ser 
145 150 155 ' 160 

Pro Pro Lys Ser Cys Thr Met Phe Lys Pro Val Pro Glu Ser Ser Gin 
165 170 175 

55 Leu Gly Lys Gin Gly Ala Cys Lys Ser Thr Cys Ser Glu Leu Gly Ser 

180 185 190 

Ser Gly He Glu Lys Thr A.sp Asn Leu Cys Ser Lys Tyr Pro Glu Leu 

195 200 205 

Leu Gly He Gin His Ala Pro He Thr Arg Lys Thr A.sp Leu Glu Ser 
60 210 215 220 

Ser Pro Val Trp Trp Phe Ser Pro Pro Lys Thr Cys Val Leu Met Glu 
225 230 235 240 
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Pro Val Asn Glu Lys Lys Pro lie Asp Glu Thr Gly Gly Ser Phe Asp 

24 5 250 255 

Val Pro Asn lie lie Pro Glu Ala Lys His Thr Thr Glu Gly Ser Met 
260 265 270 

5 Ser Met Val Val Glu Ser Thr Pro Leu Phe Lys Glu Pro Glu Ser lie 
275 280 285 

Met Thr Arg Asn Arg Thr Lys Ala Gly Glu Ser Thr Leu Lys Lys Glu 

290 295 300 

Leu Trp Thr Arg Phe Glu Glu Ala Thr lie His Asp Ser Arg Phe Asn 
10 305 310 315 320 

Ser Met Thr Thr Thr Thr Thr Val Arg Gly Asn Asn Lys Lys Cys Phe 

325 330 335 

Met Glu Met Leu Glu Glu Val Ser Gly Asn Glu Glu Asp His Glu Leu 
340 345 350 

15 Ser Gly Glu Phe Val Ala Leu Lys Pro Tyr Gin Val Leu lie Ser Glu 
355 360 365 

Leu Ser lie Thr Asn Ala Phe His Cys Leu Ser Thr Leu lie Ser Cys 

37 0 37 5 38 0 

Tyr lie Thr Gin Ser Asn Lys His Thr His lie Thr Tyr Arg Lys lie 
20 285 390 395 400 

Glu Asn Lys Arg Ser Gly 
4 05 

(2) INFORMATION FOR SEQ ID NO: 131: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 609 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

3 0 (ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 60 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 2032963 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
35 tttgtgtggt aattaactqg tgagagagtt caacatccaa tctttctctc tctctctcta 60 
tcttttatct ctcacactct taaaatgctt cttctatctc cgatctctgc ctctcttcca 120 
ccgtcgtttc accggggtaa tttgatccgg cgatcaatta agccgttagg tagggttgtt 180 
gcaaaggcca aggacaacac cgacactggt ggatttctgg agacggcagc tatagccggt 240 
agtttagtgt cgacgccggt tattggatgg tcactctaca ctttgaagac cacgggatgt 300 

4 0 ggcttacctc ccggaccggc cggtttaatc ggagcgttgg aaggtgtgag ctacttggtg 360 

gtggttggca tcgtgggctg gtctttgtac actaaaacaa aaactgggtc aggtctgcca 420 
aatgggccat ttggtttgtt gggtgcggtc gagggtttat cgtatctgtc ggttctagcc 480 
attcttgtgg tgtttggtat tcagttcttg gataatgggt cggttccagg tccacttcct 540 
agtgaccagt gttttggtta accacgtgta atgtcgtata aataaagtgt caaattaata 600 
45 attgctttc 

(2) INFORMATION FOR SEQ ID NO: 132: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 
50 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 
55 (B) LOCATION: 1..158 

(D) OTHER INFORMATION: / Ceres Seq. ID 2032964 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
Met Leu Leu Leu Ser Pro lie Ser Ala Ser Leu Pro Pro Ser Phe His 
15 10 15 

60 Arg Gly Asn Leu lie Arg Arg Ser lie Lys Pro Leu Gly Arg Val Val 

2 0 25 " 30 

Ala Lys Ala Lys Asp Asn Thr Asp Thr Gly Gly Phe Leu Glu Thr Ala 
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Ala 


lie 


Ala 


Gly 


Ser 


Leu 


Val 




50 










55 


Tyr 


Thr 


Leu 


Lys 


Thr 


Thr 


Gly 


65 










70 




Leu 


He 


Gly 


Ala 


Leu 


Glu 


Gly 










85 






Val 


Gly 


Trp 


Ser 


Leu 


Tyr 


Thr 








100 








Asn 


Gly 


Pro 


Phe 


Gly 


Leu 


Leu 






115 










Ser 


Val 


Leu 


Ala 


He 


Leu 


Val 




130 










135 


Gly 


Ser 


Val 


Pro 


Gly 


Pro 


Leu 


145 










150 




(2) 


INFORMATION 


FOR 


SEQ 


ID 1 
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35 40 45 

Ser Thr Pro Val He Gly Trp Ser Leu 
60 

Cys Gly Leu Pro Pro Gly Pro Ala Gly 

75 8 0 

Val Ser Tyr Leu Val Val Val Gly He 

90 95 
Lys Thr Lys Thr Gly Ser Gly Leu Pro 

105 110 
Gly Ala Val Glu Gly Leu Ser Tyr Leu 
120 125 
Val Phe Gly He Gin Phe Leu Asp Asn 
140 

Pro Ser Asp Gin Cys Phe Gly 
15 145 150 .. 155 

NO: 133 : 

; i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 
20 fC) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iij MOLECULE TYPE: DMA (genomic; 
(ix; FEATURE: 

(A) NAME /KEY: - 
25 (B) LOCATION: 1..330 

( D) OTHER INFORMATION: / Ceres Seq. ID 2033706 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
atgggccaca gagtgacgag agagagagtg aaaatggaga agtactttgg aaargcgtac 60 
aggggagatc caggagtgcc acatgcagat gcagatcgtt tcgtgaatat arggattggt 120 
30 tctgctgctt tctccgttct tacctgggtr aatccttaca tgtggcagct ctctaaccag 180 
ttcaattacc atgacaagtg gatgctgttt gagcagtacc actggaaaaa agcaagggca 240 
aagaagcaac cttatgaatt caagtggaat aagataccca aagaagtcag ggactcgtac 300 
tattacaact ggccagtcta cttcccatag 
(2) INFORMATION FOR SEQ ID NO: 134: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
4 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : peptide 

(B) LOCATION: 1 . .10 9 

(D) OTHER INFORMATION: / Ceres Seq. ID 2033707 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Met Gly His Arg Val Thr Arg Glu Arg Val Lys Met Glu Lys Tyr Phe 
15 10 15 

Gly Asn Ala Tyr Arg Gly Asp Pro Gly Val Pro His Ala Asp Ala Asp 
20 25 30 

50 Arg Phe Val Asn He Trp He Gly Ser Ala Ala Phe Ser Val Leu Thr 
35 40 45 

Trp Val Asn Pro Tyr Met Trp Gin Leu Ser Asn Gin Phe Asn Tyr His 

50 55 60 

Asp Lys Trp Met Leu Phe Glu Gin Tyr His Trp Lys Lys Ala Arg Ala 
55 65 7 0 7 5 8 0 

Lys Lys Gin Pro Tyr Glu Phe Lys Trp Asn Lys He Pro Lys Glu Val 

85 90 95 

Arg Asp Ser Tyr Tyr Tyr Asn Trp Pro Val Tyr Phe Pro 
100 105 
60 (2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME/KEY : peptide 

(B) LOCATION: 1 . . 98 

(D) OTHER INFORMATION: / Ceres Seq. ID 2033708 





(xi) 


SEQUENCE DESCRIPTION: , 


SEQ 


ID NO: 135 : 










Met 


Glu 


Lys 


Tyr 


Phe 


Gly 


Asn 


Ala 


Tyr 


Arg 


Gly 


Asp 


Pro 


Gly 


Val 


Pro 


1 








5 










10 










15 




His 


Ala 


Asp 


Ala 


Asp 


Arg 


Phe 


Val 


Asn 


He 


Trp 


He 


Gly 


Ser 


Ala 


Ala 








20 










25 










30 






Phe 


Ser 


Val 


Leu 


Thr 


Trp 


Val 


Asn 


Pro 


Tyr 


Met 


Trp 


Gin 


Leu 


Ser 


Asn 






35 










40 










45 








Gin 


Phe 


Asn 


Tyr 


His 


Asp 


Lys 


Trp 


Met 


Leu 


Phe 


Glu 


Gin 


Tyr 


His 


Trp 




50 










55 










60 










Lys 


Lys 


Ala 


Arg 


Ala 


Lys 


Lys 


Gin 


Pro 


Tyr 


Glu 


Phe 


Lys 


Trp 


Asn 


Lys 


65 










70 










75 










80 


He 


Pro 


Lys 


Glu 


Val 


Arg 


Asp 


Ser 


Tyr 


Tyr 


T vr 


Asn 


Trp 


Pro 


Val 


Tyr 



35 90 95 



Phe Pro 



(2) INFORMATION FOR SEQ ID NO : 1 3 6 : 
2 5 (i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
30 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 56 

(D) OTHER INFORMATION: / Ceres Seq. ID 2033709 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Met Trp Gin Leu Ser Asn Gin Phe Asn Tyr His Asp Lys Trp Met Leu 
15 10 15 

Phe Glu Gin Tyr His Trp Lys Lys Ala Arg Ala Lys Lys Gin Pro Tyr 
20 25 30 

4 0 Glu Phe Lys Trp Asn Lys He Pro Lys Glu Val Arg Asp Ser Tyr Tyr 
35 40 45 

Tyr Asn Trp Pro Val Tyr Phe Pro 

50 55 
(2) INFORMATION FOR SEQ ID NO: 137: 
4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .418 

(D) OTHER INFORMATION: / Ceres Seq. ID 2043118 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

ataacaatat caaaagataa aaacagagtt tgcttttata agatcaaaga aggtaaaaaa 60 
agaaatggca aaatggagtg cgatcgtatt gataatgatg gtgatgattg ttgcggtcac 120 
aatagaagca caagaagaaa gtggtgggac tatttgtttt cgccaatgtt ctcagccttg 180 
tcgcagtgac gacggtagtt gctatgaaaa ctgtaaaatc gaatgcggtg gtcccaagcc 240 

60 tcccttatct cgtctacgaa gctcacatgc atgagatggc atcatcaatg gaagttcgcg 300 
gaaggatagg atgataaaaa gagatttgta gcggattttt gatcatctct actgttttta 360 
acttcacgtt ttatattata agagtttcat taaaaagatc aataaaacga gaaaatgt 
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(2) INFORMATION FOR SEQ ID NO: 138: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 
10 (B) LOCATION: 1..69 

(D) OTHER INFORMATION: / Ceres Seq. ID 2043119 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
Met Ala Lys Trp Ser Ala lie Val Leu lie Met Met Val Met lie Val 
15 10 15 

15 Ala Val Thr lie Glu Ala Gin Glu Glu Ser Gly Gly Thr lie Cys Phe 

20 25 30 

Arg Gin Cys Ser Gin Pro Cys Arg Ser Asp Asp Gly Ser Cys Tyr Glu 

35 4 0 * 4 5 

Asn Cys Lys lie Glu Cys Giy Gly Pro Lys Pro Pro Leu Ser Arg Leu 
20 SO 55" 60 

Arg Ser Ser His Ala 
65 

.2} INFORMATION FOR SEQ ID NO: 139: 
(i) SEQUENCE CHARACTERISTICS: 
2 5 (A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

30 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: / Ceres Seq. ID 2043120 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
35 lie Thr lie Ser Lys Asp Lys Asn Arg Val Cys Phe Tyr Lys lie Lys 
1 5^ 10 15 

Glu Gly Lys Lys Arg Asn Gly Lys Met Glu Cys Asp Arg lie Asp Asn 

20 25 " 30 

Asp Giy Asp Asp Cys Cys Gly His Asn Arg Ser Thr Arg Arg Lys Trp 
4 0 " 35 4 0 45"^ 

Trp Asp Tyr Leu Phe Ser Pro Met Phe Ser Ala Leu Ser Gin 

50 55 60 

(2} INFORMATION FOR SEQ ID NO: 140: 
(i) SEQUENCE CHARACTERISTICS: 
4 5 (A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

50 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 59 

(D) OTHER INFORMATION: / Ceres Seq. ID 2043121 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:140: 
55 Met Met Val Met He Val Ala Val Thr He Glu Ala Gin Glu Glu Ser 
15 10 15 

Gly Gly Thr He Cys Phe Arg Gin Cys Ser Gin Pro Cys Arg Ser Asp 

20 25 30 

Asp Gly Ser Cys Tyr Glu Asn Cys Lys He Glu Cys Gly Gly Pro Lys 
60 35 40 45 

Pro Pro Leu Ser Arg Leu Arg Ser Ser His Ala 
50 " 55 
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(2) INFORMATION FOR SEQ ID NO: 141: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: - 
10 (B) LOCATION: 1..838 

(D) OTHER INFORMATION: / Ceres Seq. ID 2047214 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
acacgaaacg attccgcttc tccgcttcct tcataaatat ctcactcatc gtctctagat 60 
tgttctttca cacgcaattt tcatttctct cttaaccgta aacgaatctc tcttctcaaa 120 
15 gttttatttt ctctctgcga tggctcaaga agatgttact gctgttgcta ctaacggtgc 180 
tggtccggtg gagacgcatc ttgtcttcac tgagttcaag cagatgttgc tcgttgaagc 240 
tcaaaaggtc ggtgacgctg ttactttcta caaatctgct tttggtgcga tcgagtctgg 300 
tcattctctt taccctaagc graagcttga ccaagagctt cctcatgttc tctcttctga 360 
gcttaatctc gctggctctt ccttcgttgt ttgcgacgtt tcctctctcc ctggtttttc 420 
20 tactgcgaaa tcggaaggtt cgggagtgac ttttcttctc ggaacraagg atgcrgaagc 480 
cgccgttgcq aaagctgttg acgccggagc tqtgaaagtg gaggttacgg aggcagaagt 540 
tgaaccggga ttcaaaggaa aagntacgga tccttttggt gtcacttgga tcttcgcgga 600 
gaagaagacc gtgatcaccg acgagaacaa agaggtttag aatctgtcgt cggatctaat 660 
tcttcgtcgg tttctgaaca aaaaaaacta tctattatcc taggtttatg cttttatttt 720 
25 gtcttttgtg gaatccggta atcagtaaac cggatcgagt aggctaatga ctttcggatt 780 
ctaatttcac cccttttgac aaactctaat cttggtgaaa rtgcatatta atctcggg 
(2) INFORMATION FOR SEQ ID NO : 1 4 2 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
fix) FEATURE: 

3 5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 166 

(D) OTHER INFORMATION: / Ceres Seq. ID 2047215 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 2 : 
Met Ala Gin Glu Asp Val Thr Ala Val Ala Thr Asn Gly Ala Gly Pro 
40 1 5 10 15 

Val Glu Thr His Leu Val Phe Thr Glu Phe Lys Gin Met Leu Leu Val 
20 25 ' 30 

il Thr Phe Tyr Lys Ser Ala Phe 
45 

4 5 Gly Ala lie Glu Ser Gly His Ser Leu Tyr Pro Lys Arg Lys Leu Asp 

60 

Ser Glu Leu Asn Leu Ala Gly Ser 

75 80 
Ser Leu Pro Gly Phe Ser Thr Ala 
50 85 90 ' 95 

Phe Leu Leu Gly Thr Lys Asp Ala 
105 110 
Asp Ala Gly Ala Val Lys Val Glu 
125 

Gly Phe Lys Gly Lys Val Thr Asp 
130 135 140 

Pro Phe Gly Val Thr Trp lie Phe Ala Glu Lys Lys Thr Val lie Thr 
145 ~ 150 155 ' 160 

Asp Glu Asn Lys Glu Val 
60 165 

(2) INFORMATION FOR SEQ ID NO : 1 4 3 : 
(i) SEQUENCE CHARACTERISTICS: 



55 



Glu 


Ala 


Gin 


Lys 


Val 


Gly 


Asp 


Ala 






35 










40 


Gly 


Ala 


He 


Glu 


Ser 


Gly 


His 


Ser 




50 










55 




Gin 


Glu 


Leu 


Pro 


His 


Val 


Leu 


Ser 


65 










70 






Ser 


Phe 


Val 


Val 


Cys 


Asp 


Val 


Ser 










85 








Lys 


Ser 


Glu 


Gly 


Ser 


Gly 


Val 


Thr 








100 










Glu 


Ala 


Ala 


Val 


Ala 


Lys 


Ala 


Val 






115 










120 


Val 


Thr 


Glu 


Ala 


Glu 


Val 


Glu 


Leu 
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(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . .138 

(D) OTHER INFORMATION: / Ceres Seq. ID 2047216 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



Met 


Leu 


Leu 


Val 


Glu 


Ala 


Gin 


Lys 


Val 


Gly 


Asp 


Ala 


Val 


Thr 


Phe 


Tyr 


1 








5 










10 










15 




Lys 


Ser 


Ala 


Phe 


Gly 


Ala 


He 


Glu 


Ser 


Gly 


His 


Ser 


Leu 


Tyr 


Pro 


Lys 








20 










25 










30 






Arg 


Lys 


Leu 


Asp 


Gin 


Glu 


Leu 


Pro 


His 


Val 


Leu 


Ser 


Ser 


Glu 


Leu 


Asn 






35 










40 










45 








Leu 


Ala 


Gly 


Ser 


Ser 


Phe 


Val 


Val 


Cys 


Asp 


Val 


Ser 


Ser 


Leu 


Pro 


Gly 




50 










55 










60 










Phe 


Ser 


Thr 


Ala 


Lys 


Ser 


Glu 


Gly 


Ser 


Gly 


Val 


Thr 


Phe 


Leu 


Leu 


Gly 


65 










70 










^5 










8 0 


Thr 


Lys 


Asp 


Ala 


Glu 


Ala 


AI a 


Val 


Ala 


Lys 


Ala 


- - _ i 
■j a j. 


Asp 


Ala 


Gly 


- i _ 










6 5 










90 










9 5 




Val 


Lys 


Val 


Glu 


Val 


Thr 


Glu 


Ala 


Glu 


Val 


Glu 


Leu 


Gly 


Phe 


Lys 


Gly 








100 










105 










110 






Lys 


Val 


Thr 


Asp 


Pro 


Phe 


Gly 


Val 


Thr 


Trp 


He 


Phe 


Ala 


Glu 


Lys 


Lys 






115 










120 










125 








Thr 


Val 


He 


Thr 


Asp 


Glu 


Asn 


Lys 


Glu 


Val 















130 135 
(2) INFORMATION FOR SEQ ID NO : 1 4 4 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
fix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 98 

(D) OTHER INFORMATION : / Ceres Seq. ID 2047217 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 4 : 
Met Leu Leu Leu Leu Leu Leu Thr Val Leu Val Arg Trp Arg Arg He 
1 5 10 15 

Leu Ser Ser Leu Ser Ser Ser Arg Cys Cys Ser Leu Lys Leu Lys Arg 

20 2 5 ^ 30 

Ser Val Thr Leu Leu Leu Ser Thr Asn Leu Leu Leu Val Arg Ser Ser 

35 40 45 

Leu Val He Leu Phe Thr Leu Ser Val Ser Leu Thr Lys Ser Phe Leu 

50 55 60 

Met Phe Ser Leu Leu Ser Leu He Ser Leu Ala Leu Pro Ser Leu Phe 
65 70 75 80 

Ala Thr Phe Pro Leu Ser Leu Val Phe Leu Leu Arg Asn Arg Lys Val 
8 5 90 95 

Arg Glu 



(2) INFORMATION FOR SEQ ID NO: 145: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(ix) FEATURE: 
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(A) NAME /KEY : - 

(B) LOCATION: 1..682 

(D) OTHER INFORMATION: / Ceres Seq. ID 2047438 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
5 atgattggtg gtttcttcgc cgatgtaaaa ttcaacagat cgagagagaa aaaactctct 60 
ctccctcctc cgttcaccgt cgcgactcgg ctctacagta cattttgttg agcaatctcg 120 
tatttagagg attccaatgt ctggcgcaga ggataataaa tcttcccatg ctcagctttc 180 
gtcccaaatt tttcttgatc ttgtggattc ggtgatagct gatgtagcat ctgagtgtca 240 
tcgggtagca cgactaggac ttgatcgtga tttggatata gtagaagaag agttgagatt 300 
10 gtcagtggaa gcgcgtgcga agatcgctga tcctagcaac aacctcgaaa ccaacaccaa 360 
atatgttgtt gatatatttg ggcagactca ccctcctgta gctagtgaag tgttcaattg 420 
catgaactgt gggcgacaaa ttgttgcagg aaggtttgct cctcatctcg aaaaatgcat 480 
gggaaaggga agaaaggctc gtgccaagac aaccagaagc acaacggctg cacagaaccg 540 
gaatgcacga cgcagcccca atccacgata ttctccttat ccaaattctg ctagtgagaa 600 
15 ccagttagca agtggatcac ctggtgttgc aggtgaagac tgctcaaatt tcacagttcg 660 
agagaacgtg aaaggagact ga 
(2) INFORMATION FOR SEQ ID NO : 1 4 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 
20 (B) TYPE: amino acid 

;C} STRANDEDNESS: 

(D) TOPOLOGY: linear 
Uij MOLECULE TYPE: peptide 
■ix) FEATURE: 
2 5 (A) NAME/KEY: peptide 

(B) LOCATION: 1 . .181 

(D) OTHER INFORMATION: / Ceres Seq. ID 2047439 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 6 : 

Met 

30 



35 



Met 


Ser 


Gly 


Ala 


Glu 


Asp 


Asn 


Lys 


Ser 


Ser 


His 


Ala 


Gin 


Leu 


Ser 


Ser 


1 








5 










10 










15 




Gin 


He 


Phe 


Leu 


Asp 


Leu 


Val 


Asp 


Ser 


Val 


He 


Ala 


Asp 


Val 


Ala 


Ser 








20 










25 










30 






Glu 


Cys 


His 


Arg 


Val 


Ala 


Arg 


Leu 


Gly 


Leu 


Asp 


Arg 


Asp 


Leu 


Asp 


He 






35 










40 










45 








Val 


Glu 


Glu 


Glu 


Leu 


Arg 


Leu 


Ser 


Val 


Glu 


Ala 


Arg 


Ala 


Lys 


He 


Ala 




50 










55 










60 










Asp 


Pro 


Ser 


Asn 


Asn 


Leu 


Glu 


Thr 


Asn 


Thr 


Lys 


Tyr 


Val 


Val 


Asp 


He 


65 










70 










75 










80 


Phe 


Gly 


Gin 


Thr 


His 


Pro 


Pro 


Val 


Ala 


Ser 


Glu 


Val 


Phe 


Asn 


Cys 


Met 










85 










90 










95 




Asn 


Cys 


Gly 


Arg 


Gin 


He 


Val 


Ala 


Gly 


Arg 


Phe 


Ala 


Pro 


His 


Leu 


Glu 








100 










105 










110 






Lys 


Cys 


Met 


Gly 


Lys 


Gly 


Arg 


Lys 


Ala 


Arg 


Ala 


Lys 


Thr 


Thr 


Arg 


Ser 






115 










120 










125 








Thr 


Thr 


Ala 


Ala 


Gin 


Asn 


Arg 


Asn 


Ala 


Arg 


Arg 


Ser 


Pro 


Asn 


Pro 


Arg 




130 










135 










140 










Tyr 


Ser 


Pro 


Tyr 


Pro 


Asn 


Ser 


Ala 


Ser 


Glu 


Asn 


Gin 


Leu 


Ala 


Ser 


Gly 


145 










150 










155 










160 


Ser 


Pro 


Gly 


Val 


Ala 


Gly 


Glu 


Asp 


Cys 


Ser 


Asn 


Phe 


Thr 


Val 


Arg 


Glu 










165 










170 










175 




Asn 


Val 


Lys 


Gly 


Asp 

























40 



45 



50 

Asn Val Lys 

18 0 

(2) INFORMATION FOR SEQ ID NO: 147: 
(i> SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
60 (ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..501 
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(D) OTHER INFORMATION: / Ceres Seq. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
aaacaaaggt gtcaagaaga aggatcaaca tattaattga ctaaaaatga aggcatggat 
gataatcttg ttggtgattt gtgtcgctgt ggtggtggag caatcagagg ctcgcaaagg 
tcgaaagtat ttaaatccag gcgtgcttga^ccggtgt cgt ggtcctaatc ctccagcggg 
atgtcatcct cacaattccc accacaaacc tcgcgtccct gttcacaatt atagtcgtgg 
ttgtagtaga attacccggt gcagacgaga tgcctaggtt taacctcaca tgacgacgac 
gtccctttat attgatcctt cttctacatg agacttgcaa tgatattata tattcatata 
tatacacatt ecgctgtata tgataattcc agttgattaa tatataaatc ttactgattt 
caattcaata cacaagatcc tgtaactgaa ataattttgt ccatgttttg tgcttataat 
aatggtatga ctcttattat t 
(2) INFORMATION FOR SEQ ID NO: 14 8: 
;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 
.ii) MOLECULE TYPE: peptide 
FEATURE: 
(A) NAME /KEY : peptide 
• B ; LOCATION: I . . 7 6 

(D) OTHER INFORMATION: / Ceres Seq. I 
SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Ala Trp Met lie lie Leu Leu Val lie Cys 
5 

Glu Gin Ser Glu Ala Arg Lys Gly 

20 25 
Leu Asp Arg Cys Arg Gly Pro Asn 



1 
1 
2 
3 
3 
4 
4 



Met 

Val 

Val 

His 

Gly 

65 

(2) 



ix ; 



i xi ■ 



2049057 



val Ala 



10 

Arg Lys Tyr Leu 



Pro Pro Ala 



35 

Asn Ser 

50 

Cys Ser 



His His Lys 



Pro 
55 



40 
Arg 



Val Pro Val 



His 
60 



Gly 
45 
Asn 



Asn 

30 
Cys 



Val 

15 

Pro 



Val 



Sly 



His Pro 



Tyr Ser Arg 



Ar-r Cys Arg Ax~g Asp Ala 
7 5 

ID NO : 1 4 9 : 



Arg lie Thr 
7 0 

INFORMATION FOR SEQ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



iii) 
i ix) 



peptide 



Met 
1 

Glu 

Cys 

His 

He 
65 

{2} 



peptide 
1 . . 72 

(D) OTHER INFORMATION: / Ceres Seq. ID 2049058 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

He He Leu Leu Val He Cys Val Ala Val Val Val Glu Gin Ser 

5 10 15 

Ala Arg Lys Gly Arg Lys Tyr Leu Asn Pro Gly Val Leu Asp Arg 



20 

Gly Pro 
35 

Lys Pro Arg 
50 

Thr Arg Cys 



25 
Gly 



30 
Asn 



Ser His 



Asn Pro Pro Ala Gly Cys His Pro His 

40 45 
Val Pro Val His Asn Tyr Ser Arg Gly Cys Ser Arg 

55 60 
Arg Arg Asp Ala 
7 0^ 

FOR SEQ ID NO: 150: 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 



INFORMATION 

(i) 



(ii) 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 1007 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050386 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



ccaaatttct ccatggatcg ttctttgaat ctcctcgatt tagccttagg gttcgatgag 60 

cagctagcta ttccatcgcc actaaatgga aaagtaatac taatagaaga ctgtgtagag 120 

acgagtggtt cctttgtact tcaccagcta atgaaacgtg ttctctcctc taactcctcc 180 

gacgcactta tctttctcgc ttttgctcgc cctttctctc attatgatcg aatcttgcgt 240 

10 aaactgggat gtaatttagc tacccataag tcgaataatc gattggtgtt ctttgacatg 300 

ctcatggtta agtgttcaga tggggatcaa atggaagaca atgtgagtgc agttgcgaaa 360 

ctatttcggg agatacaaga aaccgttcga aagctacaga gtgtaacaag tggtaacata 420 

actgttatgg tggatgacat gtctctgctg gaaattgcta ctaccggcag caactcagat 4 80 

cacgtattgg acttcttgca ttattgccac acattaagtt ctgaaagcaa ttgttcattg 540 

15 gtcatcctca atcatgaaga tatatacgcg agcatggaga gacctgcatt tttgctacag 600 

atggtatgcc ttgcagatgt tgtgataaag gcagagcctt tagcctctgg tttagcaaat 660 

gatgtacatg gccaattgac tgttctgaac aaagggataa gcaactcagg tagaggaagc 720 

rcgaggaaca agttgcagaa ttttcaattc aggatcaaag aaaatggtat cgactatttc 780 

tatcctggtt gcagaagctg aggattagac accgttctga gcattagcat ggtagtgaga 840 

20 tgtttttcta gacagttttg ttatttgcgt ttaggttaaa cagagacact ctgttaaaca 900 

tttttacttt agtgtctctt ctggttgtaa cttgatgact tagctaaagt tgaatgcctg 960 



agacacagaa acttgtagac atggaaaact agtaacattc tatatct 
[2] INFORMATION FOR SEQ ID NO : I 5 1 : 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
30 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..266 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050387 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 1 : 



35 


Pro 
1 


Asn 


Phe 


Ser 


Met 
5 


Asp 


Arg 


Ser 


Leu 


Asn 
10 


Leu 


Leu 


Asp 


Leu 


Ala 
15 


Leu 




Gly 


Phe 


Asp 


Glu 

20 


Gin 


Leu 


Ala 


He 


Pro 
25 


Ser 


Pro 


Leu 


Asn 


Gly 
30 


Lys 


Val 




He 


Leu 


He 


Glu 


Asp 


Cys 


Val 


Glu 


Thr 


Ser 


Gly 


Ser 


Phe 


Val 


Leu 


His 


40 






35 










40 










45 










Gin 


Leu 
50 


Met 


Lys 


Arg 


Val 


Leu 
55 


Ser 


Ser 


Asn 


Ser 


Ser 
60 


Asp 


Ala 


Leu 


He 




Phe 


Leu 


Ala 


Phe 


Ala 


Arg 


Pro 


Phe 


Ser 


His 


Tyr 


Asp 


Arg 


He 


Leu 


Arg 




65 










70 










75 










80 


45 


Lys 


Leu 


Gly 


Cys 


Asn 
85 


Leu 


Ala 


Thr 


His 


Lys 
90 


Ser 


Asn 


Asn 


Arg 


Leu 
95 


Val 




Phe 


Phe 


Asp 


Met 
100 


Leu 


Met 


Val 


Lys 


Cys 
105 


Ser 


Asp 


Gly 


Asp 


Gin 
110 


Met 


Glu 




Asp 


Asn 


Val 


Ser 


Ala 


Val 


Ala 


Lys 


Leu 


Phe 


Arg 


Glu 


He 


Gin 


Glu 


Thr 


50 






115 










120 










125 










Val 


Arg 
130 


Lys 


Leu 


Gin 


Ser 


Val 
135 


Thr 


Ser 


Gly 


Asn 


He 
140 


Thr 


Val 


Met 


Val 




Asp 


Asp 


Met 


Ser 


Leu 


Leu 


Glu 


He 


Ala 


Thr 


Thr 


Gly 


Ser 


Asn 


Ser 


Asp 




145 










150 










155 










160 


55 


His 


Val 


Leu 


Asp 


Phe 
165 


Leu 


His 


Tyr 


Cys 


His 
170 


Thr 


Leu 


Ser 


Ser 


Glu 
175 


Ser 




Asn 


Cys 


Ser 


Leu 
180 


Val 


He 


Leu 


Asn 


His 
185 


Glu 


Asp 


He 


Tyr 


Ala 
190 


Ser 


Met 




Glu 


Arg 


Pro 


Ala 


Phe 


Leu 


Leu 


Gin 


Met 


Val 


Cys 


Leu 


Ala 


Asp 


Val 


Val 


60 






195 










200 










205 










He 


Lys 


Ala 


Glu 


Pro 


Leu 


Ala 


Ser 


Gly 


Leu 


Ala 


Asn 


Asp 


Val 


His 


Gly 



210 215 220 
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Gin Leu Thr Val Leu Asn Lys Gly He Ser Asn Ser Gly Arg Gly Ser 
225 230 235 240 

Ser Arg Asn Lys Leu Gin Asn Phe Gin Phe Arg He Lys Glu Asn Gly 
245 250 255 

5 He Asp Tyr Phe Tyr Pro Gly Cys Arg Ser 
260 265 
(2) INFORMATION FOR SEQ ID NO: 152: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

15 (A) NAME/KEY: peptide 

(B) LOCATION: 1..2 62 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050388 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 





i v je u 


As p 


Ii v n 




Leu 


As n 


Leu 


Leu 


As p 


Leu 


£ "1 n 


Leu 


Kj± y 


ne 


Asp 


r 1 n 
Kj± u 




i 

i 


















i 'j 










1 H 

1 D 






'^j j- n 




jr\± a 




Pro 


Coy- 

is e r 


hr r O 


Leu 


as n 


Cj j_y 




V ct J_ 


j. ±e 


Leu 


i ± e 


C 1 i - 
biU 






































Asp 




vdl 


<vj _LU 


i nr 


Q o V s 

o e r 


j_ y 


Ser 


ir ne 


V d± 


Leu 


nlS 


j. n 


Leu 


rJcL 


Lys 








-j o 










■4 U 










4 3 








o R 


Ar g 


"7-1 


Leu 


C ^ "V~ 


Ser 


Asn 


C /-s -tr- 

be r 


ber 


Asp 


Ala 


Leu 


ire 


O V-v ^ 

r ne 


Leu 


a± a 


ir ne 






c , 0 

-J vJ 




















D U 












Ala 


Arg 


Pro 


Phe 


Ser 


His 


Tyr 


Asp 


Arg 


He 


Leu 


Arg 


Lys 


Leu 


Gly 


Cys 




65 










70 










75 










80 




Asn 


Leu 


Ala 


Thr 


His 


Lys 


Ser 


Asn 


Asn 


Arg 


Leu 


Val 


Phe 


Phe 


Asp 


Met 


30 










85 










90" 










95 






Leu 


Met 


Val 


Lys 


Cys 


Ser 


Asp 


Gly 


Asp 


Gin 


Met 


Glu 


Asp 


Asn 


Val 


Ser 










100 










105 










110 








Ala 


Val 


Ala 


Lys 


Leu 


Phe 


Arg 


Glu 


He 


Gin 


Glu 


Thr 


Val 


Arg 


Lys 


Leu 








115 










120 










125 








35 


Gin 


Ser 


Val 


Thr 


Ser 


Gly 


Asn 


He 


Thr 


Val 


Met 


Val 


Asp 


Asp 


Met 


Ser 






130 










135 










140 












Leu 


Leu 


Glu 


lie 


Ala 


Thr 


Thr 


Gly 


Ser 


Asn 


Ser 


Asp 


His 


Val 


Leu 


Asp 




145 










150 










155 










160 




Phe 


Leu 


His 


Tyr 


Cys 


His 


Thr 


Leu 


Ser 


Ser 


Glu 


Ser 


Asn 


Cys 


Ser 


Leu 


40 










165 










170 










175 






Val 


He 


Leu 


Asn 


His 


Glu 


Asp 


lie 


Tyr 


Ala 


Ser 


Met 


Glu 


Arg 


Pro 


Ala 










180 










185 










190 








Phe 


Leu 


Leu 


Gin 


Met 


Val 


Cys 


Leu 


Ala 


Asp 


Val 


Val 


He 


Lys 


Ala 


Glu 








195 










200 










205 








45 


Pro 


Leu 


Ala 


Ser 


Gly 


Leu 


Ala 


Asn 


Asp 


Val 


His 


Gly 


Gin 


Leu 


Thr 


Val 






210 










215 










220 












Leu 


Asn 


Lys 


Gly 


He 


Ser 


Asn 


Ser 


Gly 


Arg 


Gly 


Ser 


Ser 


Arg 


Asn 


Lys 




225 










230 










235 










240 




Leu 


Gin 


Asn 


Phe 


Gin 


Phe 


Arg 


He 


Lys 


Glu 


Asn 


Gly 


He 


Asp 


Tyr 


Phe 


50 










245 










250 










255 






Tyr 


Pro 


Gly 


Cys 


Arg 


Ser 























260 



(2) INFORMATION FOR SEQ ID NO: 153: 
(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

60 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1. .216 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



661 

(D) OTHER INFORMATION: / Ceres Seq. ID 2050389 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
Met Lys Arg Val Leu Ser Ser Asn Ser Ser Asp Ala Leu lie Phe Leu 
1 5 10 15 

Ala Phe Ala Arg Pro Phe Ser His Tyr Asp Arg lie Leu Arg Lys Leu 

20 25 30 

Gly Cys Asn Leu Ala Thr His Lys Ser Asn Asn Arg Leu Val Phe Phe 

35 40 45 

Asp Met Leu Met Val Lys Cys Ser Asp Gly Asp Gin Met Glu Asp Asn 

50 5 5 ~ " " 60 

Val Ser Ala Val Ala Lys Leu Phe Arg Glu lie Gin Glu Thr Val Arg 
65 70 75 80 

Lys Leu Gin Ser Val Thr Ser Gly Asn lie Thr Val Met Val Asp Asp 

85 90 95 

Met Ser Leu Leu Glu lie Ala Thr Thr Gly Ser Asn Ser Asp His Val 

100 105 lib 

Leu Asp Phe Leu His Tyr Cys His Thr Leu Ser Ser Glu Ser Asn Cys 

115 120 125 

Ser Leu Val lie Leu Asn His Glu Asp lie Tyr Ala Ser Met Glu Arg 

130 135 " ' 140 

Pro Ala Phe Leu Leu Gin Met Val Cys Leu Ala Asp Val Val lie Lys 
145 150 155 160 

Ala Glu Pro Leu Ala Ser Gly Leu Ala Asn Asp Val His Gly Gin Leu 

165 17 0 17 5 

Thr Val Leu Asn Lys Gly lie Ser Asn Ser Gly Arg Gly Ser Ser Arg 

180 185 190 

Asn Lys Leu Gin Asn Phe Gin Phe Arg lie Lys Glu Asn Gly lie Asp 

195 200 205 

Tyr Phe Tyr Pro Gly Cys Arg Ser 

210 215 
(2} INFORMATION FOR SEQ ID NO: 154: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
DNA (genomic) 



(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



1. . 634 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053353 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 
gaataaaaat ctatttcttg tgtttgtgtt gtatttactt aaaaaaataa aggagatcag 
tttgggagaa agcagcaaag aagaaaaatg gcagcaacat ttgcaacacc atcgacggtg 
ataggcctcg gaggatcatc catcaccacc aaacccttct cttcatcctt tttaaaacca 
acattaagcg ccaagaaccc tttgagactc gccggtgcat cgggaggaag agtcacttgc 
tttgagagga actggttgag gagagatttg aacgtggtag gatttgggct gatcggatgg 
ctagctccgt cgagcattcc agcgataaat gggaagagcc tgacgggtct cttcttcgat 
agcatcggaa ctgagctcgc tcacttcccg actcctccag ctctcacttc acagttctgg 
ttgtggttgg ttacgtggca cttaggcctc ttcctctgcc tcactttcgg acaaatcgga 
ttcaagggca ggactgaaga ttacttctaa ggataactat tcttgttttc gtttgtacta 
tatgctctct cttggttatg tgtaatatta tcaatcaaaa caaagctttc tttgctattt 
gatgtttcat tctatatctc gacaagtctt tctt 
(2) INFORMATION FOR SEQ ID NO: 155: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



WO 00/40695 



PCT/US00/00466 



662 

(B) LOCATION: 1 . . 14 0 ' 

(D) OTHER INFORMATION: / Ceres Seq. ID 2053354 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l55: 
Met Ala Ala Thr Phe Ala Thr Pro Ser Thr Val He Gly Leu Gly Gly 
5 1 5 10 15 

Ser Ser He Thr Thr Lys Pro Phe Ser Ser Ser Phe Leu Lys Pro Thr 

20 25 30 

Leu Ser Ala Lys Asn Pro Leu Arg Leu Ala Gly Ala Ser Gly Gly Arg 
35 40 45 

10 Val Thr Cys Phe Glu Arg Asn Trp Leu Arg Arg Asp Leu Asn Val Val 
50 55 60 

Gly Phe Gly Leu He Gly Trp Leu Ala Pro Ser Ser He Pro Ala He 
65 70 75 80 

Asn Gly Lys Ser Leu Thr Gly Leu Phe Phe Asp Ser He Gly Thr Glu 
15 85 90 95 

Leu Ala His Phe Pro Thr Pro Pro Ala Leu Thr Ser Gin Phe Trp Leu 

100 105 110 

Trp Leu Val Thr Trp His Leu Gly Leu Phe Leu Cys Leu Thr Phe Gly 
115 120 " 125 . 

2 0 Gin He Gly Phe Lys Gly Arq Thr Glu Asp Tyr Phe 
130 135 140 

{ 2 ) I N FORMAT ION FOR SEQ ID MO : I 5 6 : 
;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: DNA (genomic) 
!ix) FEATURE: 
30 (A) NAME/KEY: - 

(B) LOCATION: 1..393 

(D) OTHER INFORMATION: / Ceres Seq. ID 2055693 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 6 : 
atgcaaaaac ctctctctct ctctcaattc acttctcctc cttcaatggc ttggcgcaac 60 
35 gcaggatctg ccgctcgttc tttcgtctcc gccaccgcaa gatcaccatc tctccgttct 120 
cccaccacgg cgcttcctcg cctccgtcct cctcaatcct ccttacctag ccgtcgcttc 180 
accttctcat caccttccag gaatctagga gcacttggtt gcacacagtc gttcttgcct 240 
ctgtacagtg ttgtggctac ttctcaactc acatctcacc ttaacgttaa cttgcgagct 300 
ttctgcgagc tgtctaacgg gaattggaaa agatgggtga tgcactcgga ttttacgtcg 360 
4 0 ctggcaagag aatggtgctg cattgttccg tga 
(2) INFORMATION FOR SEQ ID NO: 157: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
4 5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
Hi) MOLECULE TYPE: peptide 
;ix) FEATURE: 

(A) NAME/KEY: peptide 
50 (B) LOCATION: 1..130 

(D) OTHER INFORMATION: / Ceres Seq. ID 2055694 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
Met Gin Lys Pro Leu Ser Leu Ser Gin Phe Thr Ser Pro Pro Ser Met 
1 5 10 15 

55 Ala Trp Arg Asn Ala Gly Ser Ala Ala Arg Ser Phe Val Ser Ala Thr 

2 0 2 5 30 

Ala Arg Ser Pro Ser Leu Arg Ser Pro Thr Thr Ala Leu Pro Arg Leu 

35 40 45 

Arg Pro Pro Gin Ser Ser Leu Pro Ser Arg Arg Phe Thr Phe Ser Ser 
60 50 55 60 

Pro Ser Arg Asn Leu Gly Ala Leu Gly Cys Thr Gin Ser Phe Leu Pro 
65 70^ ~ 75 80 
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Leu Tyr Ser Val Val Ala Thr Ser Gin Leu Thr Ser His Leu Asn Val 

85 90 95 

Asn Leu Arg Ala Phe Cys Glu Leu Ser Asn Gly Asn Trp Lys Arg Trp 
100 105 ' 110 

5 Val Met His Ser Asp Phe Thr Ser Leu Ala Arg Glu Trp Cys Cys lie 
115 120 125 

Val Pro 
130 

(2) INFORMATION FOR SEQ ID NO: 158: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
15 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1 . . 115 

(D) OTHER INFORMATION: / Ceres Seq. ID 2055695 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



Met 


Ala 


Trp 


Arg 


Asn 


Ala 


Gly 


Ser 


Ala 


Ala 


Arg 


Ser 


Phe 


Val 


Ser 


Ala 


1 








5 










10 










15 




Thr 


Ala 


Arg 


Ser 


Pro 


Ser 


Leu 


Arg 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Pro 


Arg 








20 










25 










30 






Leu 


Arg 


Pro 


Pro 


Gin 


Ser 


Ser 


Leu 


Pro 


Ser 


Arg 


Arg 


Phe 


Thr 


Phe 


Ser 






35 










40 










45 








Ser 


Pro 


Ser 


Arg 


Asn 


Leu 


Gly 


Ala 


Leu 


Gly 


Cys 


Thr 


Gin 


Ser 


Phe 


Leu 




50 










55 










60 










Pro 


Leu 


Tyr 


Ser 


Val 


Val 


Ala 


Thr 


Ser 


Gin 


Leu 


Thr 


Ser 


His 


Leu 


Asn 


65 










70 










75 










80 


Val 


Asn 


Leu 


Arg 


Ala 


Phe 


Cys 


Glu 


Leu 


Ser 


Asn 


Gly 


Asn 


Trp 


Lys 


Arg 










85 










90 










95 




Trp 


Val 


Met 


His 


Ser 


Asp 


Phe 


Thr 


Ser 


Leu 


Ala 


Arg 


Glu 


Trp 


Cys 


Cys 








100 










105 










110 







35 lie Val Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 159: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

4 5 (A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 68 

(D) OTHER INFORMATION: / Ceres Seq. ID 2055696 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
Cys Lys Asn Leu Ser Leu Ser Leu Asn Ser Leu Leu Leu Leu Gin Trp 

5 0 1 " 5 10 15 

Leu Gly Ala Thr Gin Asp Leu Pro Leu Val Leu Ser Ser Pro Pro Pro 

20 25 30 

Gin Asp His His Leu Ser Val Leu Pro Pro Arg Arg Phe Leu Ala Ser 
35 4 0 45 

55 Val Leu Leu Asn Pro Pro Tyr Leu Ala Val Ala Ser Pro Ser His His 
50 55 60 

Leu Pro Gly lie 
65 

(2) INFORMATION FOR SEQ ID NO: 160: 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..394 

(D) OTHER INFORMATION: / Ceres Seq. ID 2056405 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



tcaatagctt atctagttct tgactgttca acgatcaatg gcaccatccg ctgcaatgct 6 

catactctca catcctctag ttagccacaa agctaagaat cagtctctgt catcgccgtc 12 

gtctgttaag tcgacacgtg tcttcggttt tctttggcct tggaaggcat tagacaatga 18 

ggatcattcc gcggttgttc tcggccggct ctttggcgat ccggctacta tcgagaagcg 24 

cttccaagaa gctcttgaac aaagctgttg gtaatttgta caatgtttgg ttgggtttat 30 

tgattgtttt tacgttaaaa tagcttttat aaattggaaa tggaagtact gtaaaatgta 36 



aaaattgact atatataatt aaaggtacat tatg 
(2) INFORMATION FOR SEQ ID NO: 161: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 78 

(D) OTHER INFORMATION: / Ceres Seq. ID 2056406 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



Met 


Ala 


Pro 


Ser 


Ala 


Ala 


Met 


Leu 


He 


Leu 


Ser 


His 


Pro 


Leu 


Val 


Ser 


1 








5 










10 










15 




His 


Lys 


Ala 


Lys 


Asn 


Gin 


Ser 


Leu 


Ser 


Ser 


Pro 


Ser 


Ser 


Val 


Lys 


Ser 








20 










25 










30 






Thr 


Arg 


Val 


Phe 


Gly 


Phe 


Leu 


Trp 


P: o 


Trp 


Lys 


Ala 


Leu 


Asp 


Asn 


Glu 






35 










40 










45 








Asp 


His 


Ser 


Ala 


Val 


Val 


Leu 


Gly 


Arg 


Leu 


Phe 


Gly 


Asp 


Pro 


Ala 


Thr 




50 










55 










60 










He 


Glu 


Lys 


Arg 


Phe 


Gin 


Glu 


Ala 


Leu 


Glu 


Gin 


Ser 


Cys 


Trp 






65 










70 










75 












(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 162: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . .72 

(D) OTHER INFORMATION: / Ceres Seq. ID 2056407 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 



Met 


Leu He 


Leu 


Ser 


His 


Pro Leu Val 


Ser 


His 


Lys 


Ala 


Lys 


Asn 


Gin 


1 






5 






10 










15 




Ser 


Leu Ser 


Ser 


Pro 


Ser 


Ser Val Lys 


Ser 


Thr 


Arg 


Val 


Phe 


Gly 


Phe 






20 






25 










30 






Leu 


Trp Pro 


Trp 


Lys 


Ala 


Leu Asp Asn 


Glu 


Asp 


His 


Ser 


Ala 


Val 


Val 




35 








40 








45 








Leu 


Gly Arg 


Leu 


Phe 


Gly 


Asp Pro Ala 


Thr 


He 


Glu 


Lys 


Arg 


Phe 


Gin 




50 








55 






60 










Glu 


Ala Leu 


Glu 


Gin 


Ser 


Cys Trp 
















65 








70 


















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 163: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 1..4 8 

(D) OTHER I N FORMAT I ON : / Ceres Seq. ID 2056408 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
10 Met Arg lie lie Pro Arg Leu Phe Ser Ala Gly Ser Leu Ala lie Arg 
1 5 10 " 15 

Leu Leu Ser Arg Ser Ala Ser Lys Lys Leu Leu Asn Lys Ala Val Gly 

20 ~ 25 30 

Asn Leu Tyr Asn Val Trp Leu Gly Leu Leu lie Val Phe Thr Leu Lys 
15 35 40 45 

(2) INFORMATION FOR SEQ ID NO: 164: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 66 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : - 
2 5 (B) LOCATION: 1..766 

(D) OTHER INFORMATION: / Ceres Seq. ID 2065747 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
acagtcgcct ccttcctgcg ccgccgccgc cgccgccgct accaccgcct cgcggccgcc 60 
tgcggctccg cgtttccgca cggatttggt gaaggtttct tgaggctgag aacatggtgc 120 
30 tgaagacgga actttgccgc ttcagcggcc agaagattta tcctgggaaa ggcattagat 180 
ttatccgtgc tgattctcag gtcttccttt ttgccaactc gaaatgcaag cgctacttcc 240 
acaaccgcct gaagcctgca aagcttacct cgacagcaat gtacaggaag cagcacaaga 300 
aggatatcca tgctgaagcg gtaaagaaga ggcgccgcgc caccaagaag ccatactcca 360 
ggtcaattgt gggtgcttcc ttggaagtaa tccagaagaa gagagctgag aagccagagg 420 
35 tccgcgatgc tgctagagaa gctgctcttc gtgagatcaa ggagcgcatc aagaagacca 480 
aggatgagaa gaaagcgaag aaggcggagg tgagcaagtc ccagaagacg cagacaaagg 54 0 

gtgcggtcca gaagggttcc aagggcccca agttgggcgg cggtggtggg aagcgctgaa 600 
agaacttagt gtcgtttctc gacattgcag tcgttcctta gccaaagcca ctttcgtaga 660 
acctgtgttg aatttgcaag acttattcaa gcgttgcttg tgcgtgctaa ataccatggc 720 
40 aagagaacgg atttatattt atgcctgaaa aaaaatgacc gttcat 
(2) INFORMATION FOR SEQ ID NO: 165: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

4 5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : peptide 

5 0 (B) LOCATION: 1..161 

(D) OTHER INFORMATION: / Ceres Seq. ID 2065748 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
Met Val Leu Lys Thr Glu Leu Cys Arg Phe Ser Gly Gin Lys lie Tyr 
1 5 10 15 

55 Pro Gly Lys Gly He Arg Phe He Arg Ala Asp Ser Gin Val Phe Leu 

20 25 30 

Phe Ala Asn Ser Lys Cys Lys Arg Tyr Phe His Asn Arg Leu Lys Pro 

35 4 0 4 5 

Ala Lys Leu Thr Trp Thr Ala Met Tyr Arg Lys Gin His Lys Lys Asp 
60 50 5 5 60 

lie His Ala Glu Ala Val Lys Lys Arg Arg Arg Ala Thr Lys Lys Pro 
65 70 75 80 



WO 00/40695 



PCT/US00/00466 



666 



i yr 


Se r 


Airg 


Se it 


i ± e 


Vdl 


L?±y 
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Lys 


D y~ 
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r\J. ci 
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100 










105 










110 




Arg 


Glu 


He 


Lys 


Glu 


Arg 


He 


Lys 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys Ala 






115 










120 
















Lys 


Lys 


Ala 


Glu 


Val 


Ser 


Lys 


Ser 


Gin 


Lys 


Thr 


Gin 


Thr 


Lys 


Gly Ala 




130 










135 










140 








Val 


Gin 


Lys 


Gly 


Ser 


Lys 


Gly 


Pro 


Lys 


Leu 


Gly 


Gly 


Gly 


Gly 


Gly Lys 


145 










150 










155 








160 


Arg 






























( 2 } 


I NT pi 


ORMATION 


FOR 


SEQ 


ID 


NO: 166 : 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



ii) MOLECULE TYPE: peptide 
ix) FEATURE: 

(A) NAME / KEY : peptide 

(B) LOCATION: 1 . .106 

(D) OTHER INFORMATION: / Ceres Seq. ID 2065749 





C xi ) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO : 1 6 6 : 










Met 




Arg 


Lys 


Gin 


His 


Lys 


Lys 


Asp 


He 


His 


Ala 


Glu 


Ala 


Val 


Lys 


1 








5 










10 










15 




Lys 


Arg 


Arg 


Arg 


Ala 


Thr 


Lys 


Lys 


Pro 


Tyr 


Ser 


Arg 


Ser 


He 


Val 


Gly 








20 










25 










30 






Ala 


Ser 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Lys 


Arg 


Ala 


Glu 


Lys 


Pro 


Glu 


Val 






35 










40 










45 








Arg 


A.sp 


Ala 


Ala 


Arg 


Glu 


Ala 


Ala 


Leu 


Arg 


Glu 


He 


Lys 


Glu 


Arg 


He 




50 










55 










60 










Lys 


Lys 


Thr 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 


Lys 


Lys 


Ala 


Glu 


Val 


Ser 


Lys 


65 










70 










75 










80 


Ser 


Gin 


Lys 


Thr 


Gin 


Thr 


Lys 


Gly 


Ala 


Val 


Gin 


Lys 


Gly 


Ser 


Lys 


Gly 










85 










90 










95 




Pro 


Lys 


Leu 


Gly 


Gly 


Gly 


Gly 


Gly 


Lys 


Arg 
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CLAIMS 

What is claimed is: 

1. An isolated nucleic acid molecule comprising a 
nucleic acid having a nucleotide sequence selected from the 
group consisting of 

(i) a nucleotide sequence shown in SEQ TABLES 1 AND 2; 

(ii) a nucleotide sequence complementary to one shown in 
SEQ TABLES 1 AND 2; 

(iii) a nucleotide sequence which is the reverse 
complement of one shown in SEQ TABLES 1 AND 2; 

(iv) a nucleotide sequence capable of hybridizing to a 
nucleotide sequence shown in SEQ TABLES 1 AND 2/ 

(iv) a nucleotide sequence capable of hybridizing to a 
nucleotide sequence complementary to one shown in SEQ TABLES 1 
AND 2; 

(vi) a nucleotide sequence capable of hybridizing to a 
nucleotide sequence which is the reverse complement of one 
shown in SEQ TABLES 1 AND 2; 

whereby the hybridization in groups (iv) to (vi) allow 
said sequences to form a duplex at a temperature from about 
Tm-40°C to about Tm-48° C. 



2. An isolated nucleic acid molecule according to claim 
1 that comprises a complete open reading frame. 



3. An isolated nucleic acid molecule according to claim 
1 that functions as a promoter or as a 3' end termination 
sequence or as a regulator sequence influencing the 
transcription rate, the transcript stability or RNA 
translation rate in a host cell. 
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4. The isolated nucleic acid of claim 3, comprising a 
sequence selected from the group consisting of a TATA box 
sequence, a CAAT box sequence, a motif of GCAATCG or a 
transcript ion- factor binding sequence • 

5. An isolated nucleic acid molecule comprising a nucleic acid 
sequence that encodes an amino acid sequence selected from the 
group consisting of the amino acid sequences shown in SEQ 
TABLES 1 AND 2, a complementary nucleotide sequence to said 
nucleic acid sequence that encodes said amino acid sequence 
selected from SEQ TABLES 1 AND 2, a reverse complementary 
nucleotide sequence to said nucleic acid sequence that encodes 
said amino acid sequence selected from SEQ TABLES 1 AND 2, and 
a nucleotide sequence that will hybridize to said nucleic acid 
sequence encoding an amino acid selected from SEQ TABLES 1 AND 
2 or said complementary sequence under hybridization conditions 
providing Tm - 40°C to Tm - 48°C. 

6. An isolated nucleic acid molecule comprising a 
nucleic acid sequence that encodes at least a part or a portion 
or a mutant or a fusion of an amino acid sequence selected from 
the group consisting of the amino acid sequences shown in SEQ 
TABLES 1 AND 2, a complementary nucleotide sequence to said 
nucleic acid sequence that encodes said part or portion or 
mutant or fusion of an amino acid sequence selected from SEQ 
TABLES 1 AND 2, a reverse complementary nucleotide sequence to 
said nucleic acid sequence that encodes said part or portion or 
mutant or fusion of an amino acid sequence selected from SEQ 
TABLES 1 AND 2, and a nucleotide sequence that will hybridize 
to said nucleic acid sequence that encodes said part or portion 
or mutant or fusion of an amino acid selected from SEQ TABLES 
1 AND 2 or said complementary sequence or said reverse 
complementary sequence under hybridization conditions providing 
Tm - 40°C to Tm - 48°C. 



WO 00/40695 



PCT/US00/00466 



669 

7. An isolated nucleic acid molecule according to claim 
I, having a nucleotide sequence selected from the group 
consisting of the nucleotide sequences shown in SEQ TABLES 1 
AND 2, and a complementary nucleotide sequence to said 

5 nucleotide sequence selected from SEQ TABLES 1 AND 2. 

8. A chimeric DNA construct comprising (a) a promoter 
sequence capable of driving gene expression in plant cells and 
operatively linked to (b) a structural gene comprising an DNA 
molecule according to any one of claims 1, 2, 5 and 7. 

9. A chimeric DNA construct comprising (a) a promoter 
that is a nucleic acid molecule according to claim 3 or 4 
operatively linked to (b) a structural gene and, optionally, 
(c) an associated UTR. 

10. An isolated DNA molecule comprising (a) a promoter 
sequence according to claim 3 or 4 and operatively linked to 
(b) a structural gene sequence according to any one of claims 
1, 2, 5, 6 and 7 . 

11. A promoter according to claim 10, wherein said 
promoter is a specific promoter. 

12. A promoter according to claim 11, wherein said 
promoter is a seed-specific promoter, an embryo-specific 
promoter, an ovule-specific promoter, a tapetum-specif ic 
promoter or a root-specific promoter. 

13. A host cell transformed with a polynucleotide 
comprising the isolated nucleic acid molecule according to 
claim 1. 
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14. An isolated polypeptide comprising an amino acid 
sequence encoded by a polynucleotide sequence shown in SEQ 
TABLES 1 AND 2, or one that is at least 75% identical thereto. 



15. An isolated polypeptide according to claim 14, that 
is at least 85% identical to said amino acid sequence. 

16. An isolated polypeptide, according to claim 15, that 
is at least 90% identical to said amino acid sequence. 

17. A polynucleotide comprising a first polynucleotide 
sequence from SEQ TABLES 1 AND 2 or a fragment thereof, wherein 

said first sequence is capable of regulating transcription or 

translation of second a polynucleotide comprising a coding 
sequence in a host cell or in vitro system wherein the first 
and second polynucleotides are operatively linked. 



18. The first polynucleotide of claim 17, which is 
capable of regulating transcription or translation in an in 
vitro system. 



19. The first polynucleotide of claim 17, which is 
capable of regulating transcription or translation in a host 
cell . 

20. The polynucleotide of claim 19, wherein said host 
cell is a plant cell. 



21. A host cell of claim 13, wherein said isolated 
nucleic acid comprises a promoter and operatively linked 
structural gene and further wherein said promoter and 
structural gene are not heterologous to each other and are 
exogenous to the host cell genome. 
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22. A method of introducing an isolated nucleic acid into 
a host cell comprising: 

(a) providing an isolated nucleic acid of any of claims 
1-12; 

5 (b) contacting said isolated nucleic acid with said host 

cell under conditions that permit insertion of said nucleic 
acid into said host cell. 

23. A method of producing a polypeptide of any one of 
claims 14-16 comprising: 

(a) providing a host cell of claim 13; 

(b) culturing said host cell under conditions that permit 
5 transcription and translation of said structural gene to 

produce a polypeptide; and 

(c) isolating said polypeptide. 

24. An antibody capable of binding to a polypeptide of 
any one of claims 14-16. 

25. An isolated nucleic acid comprising a promoter of a 
gene wherein said gene comprises a nucleic acid having the 
nucleotide sequence of claim 1. 

26. An isolated polypeptide comprising an amino acid 
sequence selected from SEQ TABLES 1 AND 2, or one that is at 
least 75% identical thereto. 

27. The isolated polypeptide according to Claim 26 that 
is at least 85% identical thereto. 

28. The isolated polypeptide according to claim 26 that 
is at least 90% identical thereto. 



